method
active
method:pca-analysis-of-token-embeddings-unembeddingsPCA Analysis of Token Embeddings/Unembeddings
PCA applied to token embedding and unembedding matrices to understand what fraction of residual stream dimensions they occupy and how they relate
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Supporting evidence for the claim that most residual stream dimensions are free for other layers to use
- Statistical method used to analyze neural activity data.
- Vector representations of individual tokens from genomic foundation models; the raw inputs to sequence pooling methods.
- Extracting embeddings from instruction and example spans.
- Used to visually inspect separation of truth-related directions in model activation space across layers
- PCA on 171 emotion probe activations across all tokens to produce ordered linear combinations and test if lower PCs are more persistent
- Used to visualize LLM true/false representations, revealing clear linear structure separating true from false statements
- Obtain instruction and example span embeddings at layer L* with chosen pooling.