claim
active
claim:transformers-are-recurrent-through-autoregression-because-k-v-stream-provides-horizontal-information-flow-across-positions

Transformers are recurrent through autoregression because K/V stream provides horizontal information flow across positions.

Claim formalizing the Anima Labs idea that transformers are effectively recurrent due to K/V stream.

Neighborhood — ranked by edge-count

Communities (1)

community

Concepts (1)

concept
  • K/V Stream
    supports
    Proposed pathway flowing across positions at each layer; carries key, value, and attention-weighted information horizontally.

Artifacts (2)

artifact

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.