finding
active
finding:pythia-14m-achieves-only-0-38-accuracy-on-npi-ever-subj-relc-taskpythia-14m achieves only 0.38 accuracy on npi_ever_subj-relc task
Baseline accuracy showing small models fail on harder NPI licensing tasks
Source paper
extracted_from(2024) · Aryaman Arora · Dan Jurafsky · Christopher Potts
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Attributed to model anisotropy from saturation making hidden states harder to access
- Probe achieves selectivity of 4.20 on pythia-410m, slightly exceeding DAS selectivity of 3.96finding0.796Key result showing that for models larger than pythia-70m, probe selectivity matches or exceeds DAS selectivity
- Linear probe achieves 100% classification accuracy for almost all components in Pythia-6.9B gender taskfinding0.793Demonstrates that linear probes can overestimate causal relevance; probes succeed on non-causally-relevant representations
- Baseline result confirming the model has fully learned the gender prediction task before probing
- Training progression result showing non-linear maps are uncorrelated with genuine task learning
- Task accuracy on CausalGym increases consistently with model scale from 0.62 (14M) to 0.89 (6.9B)finding0.745Scaling result showing larger pythia models perform better on CausalGym linguistic tasks
- DAS substantially outperforms brute-force search on MoNLI across all models.
- Mechanistic finding from CausalGym case study showing multi-step information movement in NPI mechanism