question
active
question:how-can-internal-features-be-linked-to-reliable-control-of-complex-behavior-level-semantic-attributes

how can internal features be linked to reliable control of complex, behavior-level semantic attributes?

Central challenge that the paper addresses.

Source paper

extracted_from
Mechanistic Knobs in LLMs: Retrieving and Steering High-Order Semantic Features via Sparse Autoencoders
(2026) · Ruikang Zhang · Shuo Wang · Q. Su

Neighborhood — ranked by edge-count

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.