concept
active
concept:functional-faithfulness

Functional Faithfulness

Empirical effect where intervening on one feature induces coherent shifts across multiple linguistic dimensions aligned with the target attribute.

Neighborhood — ranked by edge-count

Concepts (2)

concept
  • faithfulness
    related_to
    The condition that commitments are fulfilled.
  • Bidirectional Steering
    associated_with
    Ability to steer model behavior in two opposite semantic directions on a trait.

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.