finding
active
finding:functional-faithfulness-intervening-on-a-specific-internal-feature-induces-coherent-and-predictable-shifts-across-multiple-linguistic-dimensions-aligned-with-the-target-semantic-attribute

Functional Faithfulness: intervening on a specific internal feature induces coherent and predictable shifts across multiple linguistic dimensions aligned with the target semantic attribute.

Empirical effect observed in feature intervention experiments.

Source paper

extracted_from
Mechanistic Knobs in LLMs: Retrieving and Steering High-Order Semantic Features via Sparse Autoencoders
(2026) · Ruikang Zhang · Shuo Wang · Q. Su

Neighborhood — ranked by edge-count

Claims (1)

claim

Communities (2)

community

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.