finding
active
finding:tem-t-learns-band-cell-like-position-encoding-representations-resembling-krupic-et-al-band-cellsTEM-t learns band-cell-like position encoding representations resembling Krupic et al. band cells
Empirical result showing TEM-t position encodings also recapitulate band cells, not just grid cells.
Source paper
extracted_from(2021) · James C. R. Whittington · Joseph W. Warren · Timothy E.J. Behrens
Neighborhood — ranked by edge-count
Claims (1)
claim
- TEM memory retrieval is mathematically equivalent to transformer self-attention without softmaxsupportsCentral theoretical claim: a single step of TEM attractor dynamics equals a dot-product attention, making TEM a special case of transformer.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Empirical result showing TEM-t recapitulates entorhinal grid cell representations with linear post-transition activation.
- Empirical extension showing grid cell learning generalises to non-4-connected spatial environments.
- Novel interpretive claim about position encodings inspired by the TEM-transformer correspondence.
- TEM-t memory neurons show spatially-tuned firing resembling hippocampal place cells in each environmentfinding0.780Empirical result demonstrating that the sparse softmax activation of memory neurons produces place-cell-like spatial tuning.
- Theoretical claim linking the TEM-t architecture to the Teyler-Rudy hippocampal indexing theory.
- Neural Representations of Location Composed of Spatially Periodic Bands (Krupic et al., 2012)concept0.775Discovery of band cells; TEM-t also recapitulates these representations.
- TEM's path-integration representation g plays the role of position encodings in transformersclaim0.759Key structural correspondence claim linking the neuroscience model's spatial representation to ML concept of position encoding.
- Mechanism for encoding sequence order in transformers; paper argues these should reflect learned structural representations rather than fixed sines/cosines.