Language models encode numbers using digit representations in base 10

ByAmit Arnold Levy·Mor Geva

DOI 10.18653/v1/2025.naacl-short.33

Original abstract (expand)

Amit Arnold Levy, Mor Geva. Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers). 2025.

Related work— refs + corpus + external arXiv

Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.

Understanding Counting Mechanisms in Large Language and Vision-Language Models
Amirmohammad Izadi, Fatemeh Askari, Mobin Bagherian, Sadegh Mohammadian, Mohammad Izadi, Mahdieh Soleymani Baghshah Hosein Hasani
2026
≈ 74%
Interpreting Language Models Through Concept Descriptions: A Survey
Laura Kopf Nils Feldhus
2026
≈ 71%
Mechanistic Understanding of Language Models in Syntactic Code Completion
Daking Rai, Ziyu Yao Samuel Miller
2025
≈ 71%
A Survey of Large Language Models
Kun Zhou, Junyi Li, Tianyi Tang, Xiaolei Wang, Yupeng Hou, Yingqian Min, Beichen Zhang, Junjie Zhang, Zican Dong, Yifan Du, Chen Yang, Yushuo Chen, Zhipeng Chen, Jinhao Jiang, Ruiyang Ren, Yifan Li, Xinyu Tang, Zikang Liu, Peiyu Liu, Jian-Yun Nie and Ji-Rong Wen Wayne Xin Zhao
2026
≈ 71%
Visual Representations inside the Language Model
Amita Kamath, Madeleine Grunde-McLaughlin, Winson Han, Ranjay Krishna Benlin Liu
2025
≈ 71%
Across the Levels of Analysis: Explaining Predictive Processing in Humans Requires More Than Machine-Estimated Probabilities
Sathvik Nair and Colin Phillips
2026
≈ 71%
Towards Interpretable Sequence Continuation: Analyzing Shared Circuits in Large Language Models
Philip Torr, Fazl Barez Michael Lan
2024
≈ 71%
What do Language Models Learn and When? The Implicit Curriculum Hypothesis
Kaiser Sun, Millicent Li, Isabelle Lee, Lindia Tjuatja, Jen-tse Huang, Graham Neubig Emmy Liu
2026
≈ 71%
What do Transformers Know about Government?
Anisia Katinskaia, Lari Kotilainen, Sathianpong Trangcasanchai, Anh-Duc Vu, Roman Yangarber Jue Hou
2024
≈ 70%
Exploring the Role of BERT Token Representations to Explain Sentence Probing Results
Ali Modarressi, Mohammad Taher Pilehvar Hosein Mohebbi
2021
≈ 70%
Entangled in Representations: Mechanistic Investigation of Cultural Biases in Large Language Models
Seogyeong Jeong, Siddhesh Pawar, Jisu Shin, Jiho Jin, Junho Myung, Alice Oh, Isabelle Augenstein Haeun Yu
2026
≈ 70%
Inspecting the concept knowledge graph encoded by modern language models
Marcelo Mendoza, Alvaro Soto Carlos Aspillaga
2021
≈ 70%
Mechanistic Interpretability of Code Correctness in LLMs via Sparse Autoencoders
Kriz Tahimic and Charibeth Cheng
2025
≈ 70%
Learning to Model the World with Language
Yuqing Du, Olivia Watkins, Danijar Hafner, Pieter Abbeel, Dan Klein, Anca Dragan Jessy Lin
2024
≈ 70%
Adversarial Representation Engineering: A General Model Editing Framework for Large Language Models
Zeming Wei, Jun Sun, Meng Sun Yihao Zhang
2024
≈ 70%
Interpreting Language Model Parameters
in corpus
2026
≈ 69%
Paper Summary: Interpreting Language Model Parameters
in corpus
≈ 68%
Arithmetic in the Wild: Llama uses Base-10 Addition to Reason About Cyclic Concepts
in corpus
2026
≈ 68%
Mechanistic Interpretability of EEG Foundation Models via Sparse Autoencoders
in corpus
2026
≈ 65%
Natural Language Autoencoders Produce Unsupervised Explanations of LLM Activations
in corpus
≈ 65%
A Mathematical Framework for Transformer Circuits
in corpus
2021
≈ 63%
Denotational Design: from meanings to programs
in corpus
2015
≈ 63%
The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasets
in corpus
2023
≈ 63%
Finger Exercises in Formal Concept Analysis
in corpus
2006
≈ 63%
Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations
in corpus
2023
≈ 63%
Addressing divergent representations from causal interventions on neural networks
in corpus
2025
≈ 63%
Verbalized Eval Awareness Inflates Measured Safety
in corpus
2026
≈ 63%
Model Alignment Search
in corpus
2025
≈ 63%

Similar preprints — Semantic Scholar

Cited by (1)

Arithmetic in the Wild: Llama uses Base-10 Addition to Reason About Cyclic Concepts
Llama-3.1-8B solves cyclic arithmetic (e.g., "what month is six months after August?") not by performing modular addition in the period of the cyclic concept (12 for months, 7 for days of the week) as