finding
active
finding:filler-gap-mechanism-in-pythia-1b-crosses-over-several-different-positions-before-arriving-at-output-positionFiller-gap mechanism in pythia-1b crosses over several different positions before arriving at output position
Mechanistic finding from CausalGym case study showing complex multi-step movement for filler-gap
Source paper
extracted_from(2024) · Aryaman Arora · Dan Jurafsky · Christopher Potts
Neighborhood — ranked by edge-count
Claims (1)
claim
- Mechanistic interpretation of training dynamics in case studies
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Training dynamics finding showing filler-gap takes longer to learn than NPI licensing
- Mechanistic finding from CausalGym case study showing multi-step information movement in NPI mechanism
- NPI licensing mechanism in pythia-1b emerges in discrete stages (steps 1000, 2000, 3000) not graduallyfinding0.753Training dynamics finding showing abrupt rather than gradual emergence of NPI mechanism
- Main mechanistic finding from case studies; evidence from training checkpoint analysis of pythia-1b
- Temporary disruption of gap junctions causes planaria to reconstruct heads appropriate to other species, revealing latent morphospace attractors.
- Attributed to model anisotropy from saturation making hidden states harder to access
- Robustness check across seeds showing occasional failures of alignment map training
- Linguistic phenomenon where interrogatives extracted from a clause leave behind an empty gap; studied as case study in CausalGym