claim
active
claim:the-transformer-likely-uses-a-local-code-for-token-in-context-features-rather-than-purely-compositional-representations-because-local-codes-enable-sharper-predictions

The transformer likely uses a local code for token-in-context features rather than purely compositional representations, because local codes enable sharper predictions

Authors argue the prevalence of token-in-context features reflects genuine model computation rather than dictionary learning artifact

Source paper

extracted_from
Towards Safe and Honest AI Agents with Neural Self-Other Overlap
(2024) · Marc Carauleanu · Michael Vaiana · Judd Rosenblatt · Cameron Berg +1

Neighborhood — ranked by edge-count

Findings (1)

finding

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.