finding
active
finding:a-small-group-of-hidden-states-group-b-over-end-of-sentence-punctuation-tokens-is-highly-causally-implicated-in-truth-judgments

A small group of hidden states (group b) over end-of-sentence punctuation tokens is highly causally implicated in truth judgments

Patching experiments localize truth representations to these specific hidden states in LLaMA-2 models

Neighborhood — ranked by edge-count

Claims (1)

claim

Hypotheses (1)

hypothesis

Concepts (1)

concept

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Restated by (1)

cosine ≥ 0.90

Other entities that say roughly the same thing. May be merge candidates or independent restatements across papers.