claim
active
claim:some-mlp-neurons-and-attention-heads-perform-memory-management-by-reading-residual-stream-information-and-writing-its-negative-to-delete-it

Some MLP neurons and attention heads perform memory management by reading residual stream information and writing its negative to delete it

Hypothesis based on observed negative cosine similarity between input and output weights of some neurons

Source paper

extracted_from
A Mathematical Framework for Transformer Circuits
(2021) ·

Neighborhood — ranked by edge-count

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.