question
active
question:how-much-performance-in-points-of-loss-do-skip-trigram-bugs-cost-the-model-and-do-they-persist-in-larger-models

How much performance (in points of loss) do skip-trigram bugs cost the model, and do they persist in larger models?

Open question raised by the paper's identification of skip-trigram bugs as interpretability-visible failure modes

Source paper

extracted_from
A Mathematical Framework for Transformer Circuits
(2021) ·

Neighborhood — ranked by edge-count

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.