artifact
active
artifact:feature-visualization-interface-transformer-circuits-pub-2023-monosemantic-features-vis

Feature Visualization Interface (transformer-circuits.pub/2023/monosemantic-features/vis/)

Interactive interface for exploring all 90 learned dictionaries' features, including activating examples, logit effects, and ablations

Neighborhood — ranked by edge-count

Frameworks (1)

framework