finding
active
finding:tem-t-with-linear-activations-learns-grid-cell-like-position-encoding-representations-in-2d-spatial-environmentsTEM-t with linear activations learns grid-cell-like position encoding representations in 2D spatial environments
Empirical result showing TEM-t recapitulates entorhinal grid cell representations with linear post-transition activation.
Source paper
extracted_from(2021) · James C. R. Whittington · Joseph W. Warren · Timothy E.J. Behrens
Neighborhood — ranked by edge-count
Claims (2)
claim
- TEM memory retrieval is mathematically equivalent to transformer self-attention without softmaxsupportsCentral theoretical claim: a single step of TEM attractor dynamics equals a dot-product attention, making TEM a special case of transformer.
- TEM's path-integration representation g plays the role of position encodings in transformerssupportsKey structural correspondence claim linking the neuroscience model's spatial representation to ML concept of position encoding.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- TEM-t learns band-cell-like position encoding representations resembling Krupic et al. band cellsfinding0.856Empirical result showing TEM-t position encodings also recapitulate band cells, not just grid cells.
- Empirical extension showing grid cell learning generalises to non-4-connected spatial environments.
- Theoretical claim linking the TEM-t architecture to the Teyler-Rudy hippocampal indexing theory.
- TEM-t memory neurons show spatially-tuned firing resembling hippocampal place cells in each environmentfinding0.804Empirical result demonstrating that the sparse softmax activation of memory neurons produces place-cell-like spatial tuning.
- RNN model recapitulating grid cells; related work category 4.
- Methodological validation result confirming the place-cell metric separates cell types in TEM-t.
- Forward-looking interpretive claim about the implications of recurrent position encodings for NLP research.
- Empirical computational efficiency result comparing TEM-t to the original TEM implementation.