Linear algebraic structure of word senses, with applications to polysemy

BySanjeev Arora·Yuanzhi Li·Yingyu Liang·Tengyu Ma·Andrej Risteski

DOI 10.1162/tacl_a_00034 arXiv 1601.03764

Original abstract (expand)

Word embeddings are ubiquitous in NLP and information retrieval, but it is unclear what they represent when the word is polysemous. Here it is shown that multiple word senses reside in linear superposition within the word embedding and simple sparse coding can recover vectors that approximately capture the senses. The success of our approach, which applies to several embedding methods, is mathematically explained using a variant of the random walk on discourses model (Arora et al., 2016). A novel aspect of our technique is that each extracted word sense is accompanied by one of about 2000 “discourse atoms” that gives a succinct description of which other words co-occur with that word sense. Discourse atoms can be of independent interest, and make the method potentially more useful. Empirical tests are used to verify and support the theory.

Related work— refs + corpus + external arXiv

Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.

Interpreting Neural Networks through the Polytope Lens
Lee Sharkey, Leo Grinsztajn, Eric Winsor, Dan Braun, Jacob Merizian, Kip Parker, Carlos Ram\'on Guevara, Beren Millidge, Gabriel Alfour, Connor Leahy Sid Black
2022
≈ 75%
Probing for Semantic Classes: Diagnosing the Meaning Content of Word Embeddings
Katharina Kann, Timothy J. Hazen, Eneko Agirre and Hinrich Sch\"utze Yadollah Yaghoobzadeh
2019
≈ 74%
General Mechanism of Evolution Shared by Proteins and Words
Hsing-Yi Lai, Sun-Ting Tsai, Chen Siang Ng, Kevin Sheng-Kai Ma, Shan-Jyun Wu, Meng-Xue Tsai, Yi-Ching Su, Daw-Wei Wang, and Tzay-Ming Hong Li-Min Wang
2026
≈ 74%
Simple Mechanisms for Representing, Indexing and Manipulating Concepts
Raghu Meka, Rina Panigrahy, Kulin Shah Yuanzhi Li
2026
≈ 73%
Polychrony as Chinampas
Jose Antonio Arciniega-Nevarez, Anh Nguyen, Yitong Zou, Luke Van Popering, Nathan Crock, Gordon Erlebacher, Jose L. Mendoza-Cortes Eric Dolores-Cuenca
2026
≈ 73%
The Wittgensteinian Representation Hypothesis: Is Language the Attractor of Multimodal Convergence?
Run Shao, Dongyue Wu, Jiajie Teng, Chao Tao, Jingdong Chen, Haifeng Li Zhaoyang Zhang
2026
≈ 73%
Disentangling Neuron Representations with Concept Vectors
Vincent Andrearczyk, Henning Muller, Mara Graziani Laura O'Mahony
2023
≈ 73%
Towards Interpretable Sequence Continuation: Analyzing Shared Circuits in Large Language Models
Philip Torr, Fazl Barez Michael Lan
2024
≈ 73%
An Encoding of Abstract Dialectical Frameworks into Higher-Order Logic
Alexander Steen Antoine Martina
2026
≈ 73%
Morphological Computing as Logic Underlying Cognition in Human, Animal, and Intelligent Machine
Gordana Dodig-Crnkovic
2023
≈ 73%
Why Linear Interpretability Works: Invariant Subspaces as a Result of Architectural Constraints
Yousung Lee, Dongsoo Har Andres Saurez
2026
≈ 72%
A Unified Theory of Sparse Dictionary Learning in Mechanistic Interpretability: Piecewise Biconvexity and Spurious Minima
Harshvardhan Saini, Zhaoqian Yao, Zheng Lin, Yizhen Liao, Jingyi Cui, Yisen Wang, Mengnan Du, Dianbo Liu Yiming Tang
2026
≈ 72%
The Geometry of Concepts: Sparse Autoencoder Feature Structure
Eric J. Michaud, David D. Baek, Joshua Engels, Xiaoqing Sun, Max Tegmark Yuxiao Li
2025
≈ 72%
Discrete Latent Structure in Neural Networks
Caio F. Corro, Nikita Nangia, Tsvetomila Mihaylova, Andr\'e F. T. Martins Vlad Niculae
2026
≈ 72%
Identity as Attractor: Geometric Evidence for Persistent Agent Architecture in LLM Activation Space
Vladimir Vasilenko
2026
≈ 72%
Arithmetic in the Wild: Llama uses Base-10 Addition to Reason About Cyclic Concepts
in corpus
2026
≈ 72%
The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasets
in corpus
2023
≈ 72%
Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations
in corpus
2023
≈ 70%
Mechanistic Knobs in LLMs: Retrieving and Steering High-Order Semantic Features via Sparse Autoencoders
in corpus
2026
≈ 69%
Technological Approach to Mind Everywhere: An Experimentally-Grounded Framework for Understanding Diverse Bodies and Minds
in corpus
2022
≈ 69%
Multiple ways to implement and infer sentience
in corpus
≈ 69%
The biogenic approach to cognition
in corpus
2005
≈ 69%
Testing the Limits of Truth Directions in LLMs
in corpus
2026
≈ 69%
Information, Processes and Games
in corpus
≈ 69%
Learning without neurons in physical systems
in corpus
2022
≈ 69%
Model Alignment Search
in corpus
2025
≈ 69%
From Directions to Cones: Exploring Multidimensional Representations of Propositional Facts in LLMs
in corpus
2025
≈ 69%
The Non-Linear Representation Dilemma: Is Causal Abstraction Enough for Mechanistic Interpretability?
in corpus
2025
≈ 69%

Similar preprints — Semantic Scholar

Cited by (3)

The Non-Linear Representation Dilemma: Is Causal Abstraction Enough for Mechanistic Interpretability?
Under arbitrarily powerful alignment maps, causal abstraction becomes vacuous: any neural network can be perfectly mapped to any algorithm, a result proven formally in Theorem 1 under five mild assump
Manifold Steering Reveals the Shared Geometry of Neural Network Representation and Behavior
Manifold steering — intervening on model activations along paths constrained to lie on a learned activation manifold M_h rather than along Euclidean linear directions — produces behavioral trajectorie
Arithmetic in the Wild: Llama uses Base-10 Addition to Reason About Cyclic Concepts
Llama-3.1-8B solves cyclic arithmetic (e.g., "what month is six months after August?") not by performing modular addition in the period of the cyclic concept (12 for months, 7 for days of the week) as