concept
active
concept:sparse-feature-circuits-discovering-and-editing-interpretable-causal-graphs-in-language-models-marks-et-al-2025

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models (Marks et al., 2025)

Cited as enabling precise behavioral control through SAE features, extending the same methodological line

Neighborhood — ranked by edge-count

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.