artifact
active
artifact:transformerlens

TransformerLens

Existing interpretability library by Nanda and Bloom, cited as a prior tool with limited intervention support

Neighborhood — ranked by edge-count

Thinkers (1)

thinker
  • Neel Nanda
    studies
    External commenter; resolved apparent counterexample to linear representation hypothesis

Claims (1)

claim