hypothesis
active
hypothesis:we-hypothesize-that-persistently-active-emotional-state-representations-exist-in-llms-but-may-be-missed-by-standard-probing-methods

We hypothesize that persistently active emotional state representations exist in LLMs but may be missed by standard probing methods.

Open hypothesis from the Anthropic paper that motivates this work

Source paper

extracted_from
Persistence and Introspection of Emotion Features
Scott Sauers · Imago · Janus · Antra Tessera

Neighborhood — ranked by edge-count

Claims (1)

claim

Concepts (1)

concept

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.