thinker
active
thinker:trenton-brickenTrenton Bricken
Toy models of superposition.
Authored
2
Introduces
0
Studies
1
Affiliations
1
Cited by
4
Authored papers (2)
More papers — OpenAlex / S2
Studies (1)
Affiliations (1)
- Anthropic(institute)
Co-authors (12)
- Adam Jermyn2 shared
- Adly Templeton2 shared
- Joshua Batson2 shared
- Shan Carter2 shared
- Tom Henighan2 shared
- Adam Pearce1 shared
- Andy Jones1 shared
- Brian Chen1 shared
- C. Daniel Freeman1 shared
- Callum McDougall1 shared
- Chris Olah1 shared
- Christopher Olah1 shared
Their work is cited by (4)
- Endogenous Resistance to Activation Steering in Language Models2× refs
- Mechanistic Interpretability of EEG Foundation Models via Sparse Autoencoders2× refs
- Quantitative Introspection in Language Models: Tracking Emotive States Across Conversation1× refs
- Manifold Steering Reveals the Shared Geometry of Neural Network Representation and Behavior1× refs
Other inbound relations (4)
- authoredBricken et al. (2023) Toy models of superposition(artifact)
- citesQuantitative Introspection in Language Models: Tracking Emotive States Across Conversation(paper)
- citesSemantic Anchoring in LLMs: Thresholds, Transfer, and Geometric Correlates(artifact)
- mentionsSemantic Anchoring in LLMs: Thresholds, Transfer, and Geometric Correlates(artifact)
Recent mentions (7)
- papers-typedmckenzie-2026-endogenous-resistance.md
- papers-typedmartorell-2026-quantitative-introspection.md
- papers
towards.md - paperschang-2025-guanyin.md
- papers
scaling.md - papers-typedchang-2025-guanyin.md
- papers-typedchang-2025-guanyin.md