claim
active
claim:mds-injections-align-with-the-linear-representation-hypothesis-target-trait-varies-near-linearly-with-alpha-in-open-ended-generation

MDS injections align with the Linear Representation Hypothesis: target trait varies near-linearly with alpha in open-ended generation

Theoretical alignment claim backed by OLS R2 analysis showing 96.15% of trends have R2>=0.75

Source paper

extracted_from
Psychological Steering of Large Language Models
(2026) · Leonardo Blas · Robin Jia · Emilio Ferrara

Neighborhood — ranked by edge-count

Findings (1)

finding

Frameworks (1)

framework
  • The hypothesis that models internalize concepts as approximately linear directions in representation space; used to interpret MDS injection behavior

Methods (1)

method

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.