quote
active
quote:we-reproduce-the-results-in-meng-et-al-2022-s-figure-1-of-locating-early-sites-and-late-sites-of-factual-associations-in-gpt2-xl-in-about-20-lines-of-pyvene-codeWe reproduce the results in Meng et al. (2022)'s Figure 1 of locating early sites and late sites of factual associations in GPT2-XL in about 20 lines of pyvene code.
Load-bearing demonstration of pyvene's conciseness for complex replication tasks
Source paper
extracted_from(2024) · Zhengxuan Wu · Atticus Geiger · Aryaman Arora · Jing Huang +4
Neighborhood — ranked by edge-count
Papers (1)
paper
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Case Study I demonstrating pyvene can replicate a major interpretability result compactly
- Cited as causal intervention methodology precedent
- Cited as causal intervention methodology precedent for this paper's ablation approach
- GPT-2 implements at least one induction head using pointer arithmetic on positional embeddings rather than K-compositionhypothesis0.737Observation of an alternative induction head implementation algorithm in larger models with positional embeddings in the residual stream
- Forward-looking interpretive claim about the implications of recurrent position encodings for NLP research.
- Argues against instrumental convergence in GPT.
- Importance of recursive generation.
- Disambiguation exercise.