Arithmetic in the Wild: Llama uses Base-10 Addition to Reason About Cyclic Concepts

BySheridan Feucht·Tal Haklay·Usha Bhalla·Daniel Wurgaft·Can Rager·Raphaël Sarfati ⓘ+6 moreGoodfire, Harvard University + 2 more

DOI 10.48550/arxiv.2605.01148 arXiv 2605.01148 OpenAlex W7160247613

Christopher Alexander's 15 properties Geometric structure in neural representations Neural Geometry Activation space Causal Intervention Addition module Arithmetic Base-10 Addition Mechanism in Llama Circular Geometric Representation Cyclic concepts (months, days)Layer 18 Llama-3.1 8B Modular Arithmetic Parallel modular arithmetic

TL;DR

Llama-3.1-8B solves cyclic arithmetic (e.g., "what month is six months after August?") not by performing modular addition in the period of the cyclic concept (12 for months, 7 for days of the week) as its circularly structured representations might suggest, but by reusing a single generic base-10 addition mechanism across all such tasks. The mechanism operates in two stages: first, the model computes the raw integer sum of its inputs (six + August = 14), then maps that sum back into cyclic concept space (14 → February). The paper introduces a combined causal-abstraction and feature-geometry analysis to demonstrate that Llama-3.1-8B uses task-agnostic Fourier features with periods characteristic of base-10 arithmetic—specifically periods 2, 5, and 10—rather than the concept-native period. Strikingly, a sparse set of only 28 MLP neurons at layer 18 (approximately 0.2% of that layer's MLP) is responsible for this computation across all cyclic tasks and can be partitioned into disjoint clusters, each computing the sum for a Fourier feature of a distinct period. This implies that structured representational geometry does not entail correspondingly structured computation: a model can build domain-specific circular embeddings and yet delegate the actual arithmetic to a domain-general, base-10 calculator reused opportunistically, suggesting that mechanistic interpretability analyses that read off algorithms from representation geometry alone will systematically mislead.

What to take away

1. Llama-3.1-8B represents cyclic concepts (months, days, hours) in circularly structured embedding space yet computes cyclic offsets via base-10 addition rather than modular arithmetic in the concept's native period.
2. The two-stage mechanism first produces the raw integer sum of the two inputs (e.g., six + August = 14) and then applies a learned look-up from that sum back into the cyclic concept space (14 → February).
3. The Fourier features active during cyclic-offset computation in Llama-3.1-8B have periods 2, 5, and 10—standard base-10 arithmetic periods—not the concept-specific period such as 12 for months or 7 for days.
4. Only 28 MLP neurons at layer 18, constituting approximately 0.2% of that layer's MLP capacity, are responsible for this addition computation and are reused across all tested cyclic-concept tasks.
5. Those 28 neurons can be partitioned into disjoint clusters such that each cluster computes the partial sum corresponding to a single Fourier feature period, indicating modular functional specialization within the sparse set.
6. The methodology pairs distributed interchange intervention (a causal-abstraction tool) with Fourier analysis of residual-stream activations to jointly characterize both what is computed and where in the network it occurs—a combination another researcher could replicate on any arithmetic-over-symbols task in an open-weight transformer.
7. The finding that task-agnostic addition circuitry is reused across semantically distinct cyclic domains (calendar months, clock hours, weekdays) suggests Llama-3.1-8B learned a single general arithmetic primitive rather than per-domain arithmetic modules.
8. The paper raises the open question of whether this base-10 reuse reflects a statistical regularity in pretraining text (where cyclic quantities are almost always written in decimal notation) or a more fundamental inductive bias of transformer architectures toward base-10 computation.
9. The dissociation between representational geometry (circular, period-12 for months) and computational mechanism (base-10 addition) constitutes a direct empirical counterexample to the assumption that representation structure predicts algorithmic structure in language models.
10. The paper predicts that other LLMs trained on similarly decimal-dominated corpora will exhibit the same base-10 addition reuse for cyclic concepts, making the 28-neuron MLP cluster pattern a falsifiable substrate-level signature to test in models beyond Llama-3.1-8B.

Peer brief — for seminar discussion

This paper investigates how Llama-3.1-8B computes answers to cyclic-offset queries—questions like "what month is six months after August?"—using a methodology that combines causal abstraction (distributed interchange interventions) with Fourier analysis of residual-stream activations, an approach the authors call causal-abstraction-with-feature-geometry analysis. An alternative they could have used is activation patching alone, but that would not have revealed the periodic structure of the active features. The load-bearing finding is that despite the model having circularly structured internal representations for cyclic concepts (a geometry consistent with modular arithmetic in period 12 for months, 7 for days, etc.), it does not perform modular addition in those concept-native periods. Instead, it executes a generic two-stage computation: sum the two integer inputs in base-10 (six + August → 14), then map the resulting integer back into concept space (14 → February). The Fourier features driving that summation have periods 2, 5, and 10—canonical base-10 periods—not the period of the cyclic concept being reasoned about. This computation is implemented by just 28 MLP neurons at layer 18, roughly 0.2% of that layer's MLP, which are reused across every cyclic-concept task tested and cluster into disjoint groups each responsible for a single Fourier-period partial sum. The implication the paper argues for is substantive: representational geometry is not a reliable guide to computational mechanism. A researcher who reads the circular month embeddings and concludes the model does mod-12 arithmetic would be wrong. The base-10 reuse hypothesis also carries a prediction: other large language models trained on decimal-heavy corpora should exhibit the same base-10 addition signature for cyclic concepts, making this a testable cross-model claim. The most contestable element is scope. All cyclic tasks in the study are ones where the relevant quantities are routinely expressed as decimal integers in text (month numbers, hour numbers, weekday indices), so it is unclear whether the base-10 mechanism reflects a principled architectural bias or simply the overwhelming frequency of decimal notation in pretraining data; a critic would push back that testing on a cyclic domain where the period is not a round decimal number—say, the 360 degrees of a compass or the 52 weeks of a year expressed only as ordinal words—would be necessary to distinguish these accounts. Additionally, the analysis is confined to a single model, Llama-3.1-8B, so the generality claim rests entirely on the stated prediction rather than on cross-model evidence presented in the paper itself.

Methods (1)

Causal Intervention
Technique used to confirm that circular structures are real computational objects, not merely visualization artifacts.

Findings (8)

Llama-3.1-8B implements a two-stage algorithm: (1) compute integer sum via base-10 addition (e.g., six + August = 14), then (2) map sum to cyclic concept space (14 → February)
The complete mechanistic algorithm discovered for cyclic concept reasoning
Llama-3.1-8B uses task-agnostic Fourier features with periods 2, 5, and 10 (base-10) rather than concept-specific periods (e.g., 12 for months)
The specific Fourier feature periods identified confirm base-10 rather than modular computation
The 28 MLP neurons at layer 18 can be partitioned into disjoint clusters each computing the sum for a Fourier feature with a different period
Structural finding showing modular organization within the sparse neuron set
Llama-3.1-8B reuses a single generic addition mechanism across all cyclic tasks independently of concept-specific geometry
Key mechanistic finding showing task-agnostic reuse of arithmetic circuitry
Fourier features with period 10 contribute to base-10 sum computation in the 28-neuron cluster
One of the three base-10 Fourier periods identified in the sparse neuron set
Llama-3.1-8B uses base-10 addition rather than modular addition to compute cyclic concept sums
The central empirical finding that computation does not mirror the circular representational structure
A sparse set of 28 MLP neurons at layer 18 (~0.2% of MLP) are reused across all cyclic tasks
Quantitative finding identifying the specific neurons responsible for generic addition
Llama-3.1-8B representations for cyclic concepts are circularly structured
The representation geometry finding that motivates the question about whether computation mirrors it

Claims (5)

Language models prefer reusing generic arithmetic mechanisms over learning task-specific modular computations even when task-specific geometry exists
Broader interpretive claim about LM learning bias inferred from the findings
Approximately 0.2% of MLP neurons at layer 18 (~28 neurons) are sufficient to account for the generic addition computation across all cyclic tasks
Claim about the sparsity and sufficiency of the identified neuron set
The Fourier feature periods (2, 5, 10) respect standard base-10 addition structure rather than cyclic concept periodicity
Mechanistic claim linking identified Fourier features to base-10 arithmetic
Circular representational structure does not imply that the model computes using the corresponding modular arithmetic
The paper's primary interpretive claim, directly answering the central question
An interplay between causal abstraction and feature geometry deepens mechanistic understanding of language models
Methodological claim about the scientific value of combining causal abstraction with representational geometry analysis

Hypotheses (1)

We hypothesize that Llama-3.1-8B deploys the same base-10 addition circuitry for cyclic reasoning as it uses for general arithmetic, independent of the concept domain
Predictive hypothesis about domain-generality of the identified mechanism

Questions (3)

Does Llama compute modular addition or base-10 addition for cyclic tasks?
The specific computational question the paper resolves empirically
Does structure in representations imply structure in computation?
The central research question motivating the paper
How does Llama-3.1-8B reason over cyclic concepts?
The empirical question the paper addresses through mechanistic investigation

Original abstract (expand)

Does structure in representations imply structure in computation? We study how Llama-3.1-8B reasons over cyclic concepts (e.g., "what month is six months after August?"). Even though Llama-3.1-8B's representations for these concepts are circularly structured, we find that instead of directly computing modular addition in the period of the cyclic concept (e.g., 12 for months), the model re-uses a generic addition mechanism across tasks that operates independently of concept-specific geometry. First, it computes the sum of its two inputs using base-10 addition (six + August=14). Then, it maps this sum back to cyclic concept space (14->February). We show that Llama-3.1-8B uses task-agnostic Fourier features to compute these sums--in fact, these features have periods that respect standard base-10 addition, e.g., 2, 5, and 10, rather than the cyclic concept period (e.g., 12 for months). Furthermore, we identify a sparse set of 28 MLP neurons re-used across all tasks (approximately 0.2% of the MLP at layer 18) that can be partitioned into disjoint clusters, each computing the sum for a Fourier feature with a different period. Our work highlights how an interplay between causal abstraction and feature geometry can deepen our mechanistic understanding of LMs.

Related work— refs + corpus + external arXiv

Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.

The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasets
cited
in corpus
2023
≈ 80%
Not all language model features are one-dimensionally linear
cited
2024
≈ 81%
On the Mirage of Long-Range Dependency, with an Application to Integer Multiplication
Zichao Wei
2026
≈ 80%
Successor Heads: Recurring, Interpretable Attention Heads In The Wild
Euan Ong, George Ogden, Arthur Conmy Rhys Gould
2023
≈ 79%
Towards Interpretable Sequence Continuation: Analyzing Shared Circuits in Large Language Models
Philip Torr, Fazl Barez Michael Lan
2024
≈ 79%
Why can't transformers learn multiplication? reverse-engineering reveals long-range dependency pitfalls
cited
2025
≈ 79%
StreetMath: Study of LLMs' Approximation Behaviors
Somshubhra Roy, Maisha Thasin, Danyang Zhang, and Blessing Effiong Chiung-Yi Tseng
2025
≈ 79%
Mechanistic Interpretability of Large-Scale Counting in LLMs through a System-2 Strategy
Mohammadali Banayeeanzade, Ali Nafisi, Sadegh Mohammadian, Fatemeh Askari, Mobin Bagherian, Amirmohammad Izadi, Mahdieh Soleymani Baghshah Hosein Hasani
2026
≈ 78%
Understanding Counting Mechanisms in Large Language and Vision-Language Models
Amirmohammad Izadi, Fatemeh Askari, Mobin Bagherian, Sadegh Mohammadian, Mohammad Izadi, Mahdieh Soleymani Baghshah Hosein Hasani
2026
≈ 78%
Where Bits Matter in World Model Planning: A Paired Mixed-Bit Study for Efficient Spatial Reasoning
Anish Patnaik, Vaishak Menon Suraj Ranganath
2026
≈ 78%
Counting Circuits: Mechanistic Interpretability of Visual Reasoning in Large Vision-Language Models
Zhiyu Xue, Yihao Quan, Benlin Liu, Zeru Shi, Michelle Hurst, Jacob Feldman, Ruixiang Tang, Ranjay Krishna, Vladimir Pavlovic Liwei Che
2026
≈ 78%
Identity as Attractor: Geometric Evidence for Persistent Agent Architecture in LLM Activation Space
Vladimir Vasilenko
2026
≈ 77%
Stochastic Chameleons: Irrelevant Context Hallucinations Reveal Class-Based (Mis)Generalization in LLMs
Meng Cao, Marc-Antoine Rondeau, Jackie Chi Kit Cheung Ziling Cheng
2025
≈ 77%
What do Language Models Learn and When? The Implicit Curriculum Hypothesis
Kaiser Sun, Millicent Li, Isabelle Lee, Lindia Tjuatja, Jen-tse Huang, Graham Neubig Emmy Liu
2026
≈ 77%
LLMs Know More About Numbers than They Can Say
Li Du, Jason Eisner Fengting Yuchi
2026
≈ 77%
A geometric notion of causal probing
cited
2023
≈ 77%
A Implies B: Circuit Analysis in LLMs for Propositional Logical Reasoning
Nishanth Dikkala, Enming Luo, Cyrus Rashtchian, Xin Wang, Rina Panigrahy Guan Zhe Hong
2025
≈ 77%
Understanding Addition and Subtraction in Transformers
Clement Neo, Fazl Barez Philip Quirke
2025
≈ 77%
Beyond Language: Format-Agnostic Reasoning Subspaces in Large Language Models
Zhiyuan Su Aojie Yuan
2026
≈ 77%
Linear algebraic structure of word senses, with applications to polysemy
cited
2018
≈ 76%
Detecting the Disturbance: A Nuanced View of Introspective Abilities in LLMs
in corpus
2025
≈ 76%
Discovering variable binding circuitry with desiderata
cited
2023
≈ 76%
Emergent symbol-like number variables in artificial neural networks
cited
2025
≈ 76%
Testing the Limits of Truth Directions in LLMs
in corpus
2026
≈ 76%
Linda in context
in corpus
1989
≈ 75%
Finger Exercises in Formal Concept Analysis
in corpus
2006
≈ 75%
Language models encode numbers using digit representations in base 10
cited
2025
≈ 75%
Covariance-based Sequence Pooling
in corpus
2026
≈ 75%
Mechanistic Knobs in LLMs: Retrieving and Steering High-Order Semantic Features via Sparse Autoencoders
in corpus
2026
≈ 75%
Natural Language Autoencoders Produce Unsupervised Explanations of LLM Activations
in corpus
≈ 74%

+22 more