concept
active
concept:summarization-token-behaviorSummarization Token Behavior
Behavior where information about full clauses is encoded over clause-ending punctuation tokens in LLMs
Neighborhood — ranked by edge-count
Papers (1)
paper
Concepts (1)
concept
- Summarization Behaviorrelated_toThe phenomenon where LLMs encode clause-level information over clause-ending punctuation tokens rather than the final content token
Findings (1)
finding
- Patching experiments localize truth representations to these specific hidden states in LLaMA-2 models
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- An attention algorithm recovered by VPD where the model attends to the immediately preceding token.
- Contrasts with 7B and 13B which show consistent summarization behavior; may complicate localization at 70B scale
- Feature that fires on a specific token only within a specific surrounding context (e.g., 'the' in physics vs 'the' in mathematics)
- A pair of query and key subcomponents distributed across attention heads performs previous-token behaviorfinding0.713VPD recovers an attention algorithm for attending to the previous token, distributed across multiple heads.
- Basic unit of LLM input/output: words, parts of words, punctuation marks, emojis
- The standard set of tokens that the functional token remains a part of.
- Features that fire on every instance of a single token; appear in small dictionaries as collapsed versions of many token-in-context features
- Token-level supervision enables models to learn functional-token invocation from reasoning contextclaim0.699ATLAS author's assertion that functional tokens optimized via standard cross-entropy loss learn when and how to invoke operations from surrounding text.