finding
active
finding:pca-analysis-shows-token-embeddings-and-unembeddings-are-concentrated-in-a-relatively-small-fraction-of-residual-stream-dimensions-in-large-models

PCA analysis shows token embeddings and unembeddings are concentrated in a relatively small fraction of residual stream dimensions in large models

Supporting evidence for the claim that most residual stream dimensions are free for other layers to use

Source paper

extracted_from
A Mathematical Framework for Transformer Circuits
(2021) ·

Neighborhood — ranked by edge-count

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.