Layer 18

Specific transformer layer housing the addition module.

Neighborhood — ranked by edge-count

paper

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

early layers (0–5)concept0.735
Layers with weak anchoring due to generic representations.
Layer Normalization (Ba et al., 2016b)concept0.712
Layer normalisation used in transformer and in TEM-t position encoding preprocessing.
Math/code tasks S ≈ -1.65 at layers 8–12finding0.709
Task-specific peak anchoring score for structured reasoning domains.
deeper layers (16–28)concept0.704
Layers where anchoring weakens systematically due to representational drift.
Layer sweepmethod0.693
Procedure of systematically varying the layer at which activations are recorded and injected.
Multi Layer Perceptronframework0.681
Network with hidden layers capable of representing non-linearly separable functions, enabling deep model induction
Bioelectric Control Layerconcept0.677
A key interface exploited by evolution to accomplish morphogenesis; cells perform computations via ion channel voltage dynamics; enables integration of information across scales toward large-scale morphogenetic goals.
Zero-Layer Transformerconcept0.675
A transformer with no attention layers; shown to model bigram statistics via T = W_U W_E