finding
active
finding:our-method-enables-bidirectional-steering-of-model-behavior

Our method enables bidirectional steering of model behavior.

The method can steer the model in both positive and negative directions on the target semantic.

Source paper

extracted_from
Mechanistic Knobs in LLMs: Retrieving and Steering High-Order Semantic Features via Sparse Autoencoders
(2026) · Ruikang Zhang · Shuo Wang · Q. Su

Neighborhood — ranked by edge-count

Claims (1)

claim

Communities (3)

community

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.