finding
active
finding:pyvene-reproduces-meng-et-al-2022-figure-1-factual-association-localization-in-gpt2-xl-in-about-20-lines-of-codepyvene reproduces Meng et al. 2022 Figure 1 (factual association localization in GPT2-XL) in about 20 lines of code
Case Study I demonstrating pyvene can replicate a major interpretability result compactly
Source paper
extracted_from(2024) · Zhengxuan Wu · Atticus Geiger · Aryaman Arora · Jing Huang +4
Neighborhood — ranked by edge-count
Papers (1)
paper
Claims (1)
claim
- Core design claim of the pyvene paper summarizing its contribution over existing tools
Questions (1)
question
- Core question motivating interchange intervention and interpretability research supported by pyvene
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Load-bearing demonstration of pyvene's conciseness for complex replication tasks
- Cited as causal intervention methodology precedent
- GPT-2 implements at least one induction head using pointer arithmetic on positional embeddings rather than K-compositionhypothesis0.743Observation of an alternative induction head implementation algorithm in larger models with positional embeddings in the residual stream
- Cited as causal intervention methodology precedent for this paper's ablation approach
- Sentence localization accuracy reaches 88% at layer 2, α=5 vs. 10% chance in 10-way classificationfinding0.728Highest localization accuracy achieved, showing strong partial introspection for early-layer injections
- Claim about current practical feasibility and efficiency of 2-way associative implementations.
- Key improvement in cross-task generalization enabled by explicit instruction framing.
- DAS consistently finds the most causally-efficacious features across all pythia model sizes in CausalGymfinding0.722Main benchmark result showing DAS superiority over probing, diff-in-means, PCA, k-means, LDA, and random