framework
active
framework:intrinsic-self-correction-via-linear-representations

Intrinsic Self-Correction via Linear Representations

Framework by Lee et al. explaining self-correction via linear latent concept directions, closely related prior work.

Neighborhood — ranked by edge-count

Thinkers (1)

thinker

Frameworks (1)

framework
  • The hypothesis that models internalize concepts as approximately linear directions in representation space; used to interpret MDS injection behavior

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.