finding
active
finding:filler-gap-dependency-mechanism-in-pythia-1b-emerges-in-two-discrete-stages-steps-2000-and-10k-not-graduallyFiller-gap dependency mechanism in pythia-1b emerges in two discrete stages (steps 2000 and 10K) not gradually
Training dynamics finding showing filler-gap takes longer to learn than NPI licensing
Source paper
extracted_from(2024) · Aryaman Arora · Dan Jurafsky · Christopher Potts
Neighborhood — ranked by edge-count
Claims (1)
claim
- Main mechanistic finding from case studies; evidence from training checkpoint analysis of pythia-1b
Findings (1)
finding
- Training dynamics finding showing abrupt rather than gradual emergence of NPI mechanism
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Mechanistic finding from CausalGym case study showing complex multi-step movement for filler-gap
- Mechanistic finding from CausalGym case study showing multi-step information movement in NPI mechanism
- Mechanistic interpretation of training dynamics in case studies
- Linguistic phenomenon where interrogatives extracted from a clause leave behind an empty gap; studied as case study in CausalGym
- DAS consistently finds the most causally-efficacious features across all pythia model sizes in CausalGymfinding0.723Main benchmark result showing DAS superiority over probing, diff-in-means, PCA, k-means, LDA, and random
- Temporary disruption of gap junctions causes planaria to reconstruct heads appropriate to other species, revealing latent morphospace attractors.
- Concise definition of the core dynamic of living process.
- Surprising negative result for LDA despite being a supervised method