claim
active
claim:text-based-and-self-steered-emotionality-ratings-are-only-weakly-correlated-0-051-n-s-suggesting-they-measure-different-aspects-of-feature-emotionality

Text-based and self-steered emotionality ratings are only weakly correlated (ρ = +0.051, n.s.), suggesting they measure different aspects of feature emotionality.

Finding that the two evaluation modalities frequently diverge in their interpretation of the same SAE feature

Source paper

extracted_from
Persistence and Introspection of Emotion Features
Scott Sauers · Imago · Janus · Antra Tessera

Neighborhood — ranked by edge-count

Findings (1)

finding

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Restated by (1)

cosine ≥ 0.90

Other entities that say roughly the same thing. May be merge candidates or independent restatements across papers.