paper:feucht-goodfire-geometric-calculator-2026Arithmetic in the Wild: Llama uses Base-10 Addition to Reason About Cyclic Concepts
TL;DR
Llama-3.1-8B solves cyclic arithmetic (e.g., "what month is six months after August?") not by performing modular addition in the period of the cyclic concept (12 for months, 7 for days of the week) as its circularly structured representations might suggest, but by reusing a single generic base-10 addition mechanism across all such tasks. The mechanism operates in two stages: first, the model computes the raw integer sum of its inputs (six + August = 14), then maps that sum back into cyclic concept space (14 → February). The paper introduces a combined causal-abstraction and feature-geometry analysis to demonstrate that Llama-3.1-8B uses task-agnostic Fourier features with periods characteristic of base-10 arithmetic—specifically periods 2, 5, and 10—rather than the concept-native period. Strikingly, a sparse set of only 28 MLP neurons at layer 18 (approximately 0.2% of that layer's MLP) is responsible for this computation across all cyclic tasks and can be partitioned into disjoint clusters, each computing the sum for a Fourier feature of a distinct period. This implies that structured representational geometry does not entail correspondingly structured computation: a model can build domain-specific circular embeddings and yet delegate the actual arithmetic to a domain-general, base-10 calculator reused opportunistically, suggesting that mechanistic interpretability analyses that read off algorithms from representation geometry alone will systematically mislead.
What to take away
- 1. Llama-3.1-8B represents cyclic concepts (months, days, hours) in circularly structured embedding space yet computes cyclic offsets via base-10 addition rather than modular arithmetic in the concept's native period.
- 2. The two-stage mechanism first produces the raw integer sum of the two inputs (e.g., six + August = 14) and then applies a learned look-up from that sum back into the cyclic concept space (14 → February).
- 3. The Fourier features active during cyclic-offset computation in Llama-3.1-8B have periods 2, 5, and 10—standard base-10 arithmetic periods—not the concept-specific period such as 12 for months or 7 for days.
- 4. Only 28 MLP neurons at layer 18, constituting approximately 0.2% of that layer's MLP capacity, are responsible for this addition computation and are reused across all tested cyclic-concept tasks.
- 5. Those 28 neurons can be partitioned into disjoint clusters such that each cluster computes the partial sum corresponding to a single Fourier feature period, indicating modular functional specialization within the sparse set.
- 6. The methodology pairs distributed interchange intervention (a causal-abstraction tool) with Fourier analysis of residual-stream activations to jointly characterize both what is computed and where in the network it occurs—a combination another researcher could replicate on any arithmetic-over-symbols task in an open-weight transformer.
- 7. The finding that task-agnostic addition circuitry is reused across semantically distinct cyclic domains (calendar months, clock hours, weekdays) suggests Llama-3.1-8B learned a single general arithmetic primitive rather than per-domain arithmetic modules.
- 8. The paper raises the open question of whether this base-10 reuse reflects a statistical regularity in pretraining text (where cyclic quantities are almost always written in decimal notation) or a more fundamental inductive bias of transformer architectures toward base-10 computation.
- 9. The dissociation between representational geometry (circular, period-12 for months) and computational mechanism (base-10 addition) constitutes a direct empirical counterexample to the assumption that representation structure predicts algorithmic structure in language models.
- 10. The paper predicts that other LLMs trained on similarly decimal-dominated corpora will exhibit the same base-10 addition reuse for cyclic concepts, making the 28-neuron MLP cluster pattern a falsifiable substrate-level signature to test in models beyond Llama-3.1-8B.
Peer brief — for seminar discussion
This paper investigates how Llama-3.1-8B computes answers to cyclic-offset queries—questions like "what month is six months after August?"—using a methodology that combines causal abstraction (distributed interchange interventions) with Fourier analysis of residual-stream activations, an approach the authors call causal-abstraction-with-feature-geometry analysis. An alternative they could have used is activation patching alone, but that would not have revealed the periodic structure of the active features. The load-bearing finding is that despite the model having circularly structured internal representations for cyclic concepts (a geometry consistent with modular arithmetic in period 12 for months, 7 for days, etc.), it does not perform modular addition in those concept-native periods. Instead, it executes a generic two-stage computation: sum the two integer inputs in base-10 (six + August → 14), then map the resulting integer back into concept space (14 → February). The Fourier features driving that summation have periods 2, 5, and 10—canonical base-10 periods—not the period of the cyclic concept being reasoned about. This computation is implemented by just 28 MLP neurons at layer 18, roughly 0.2% of that layer's MLP, which are reused across every cyclic-concept task tested and cluster into disjoint groups each responsible for a single Fourier-period partial sum. The implication the paper argues for is substantive: representational geometry is not a reliable guide to computational mechanism. A researcher who reads the circular month embeddings and concludes the model does mod-12 arithmetic would be wrong. The base-10 reuse hypothesis also carries a prediction: other large language models trained on decimal-heavy corpora should exhibit the same base-10 addition signature for cyclic concepts, making this a testable cross-model claim. The most contestable element is scope. All cyclic tasks in the study are ones where the relevant quantities are routinely expressed as decimal integers in text (month numbers, hour numbers, weekday indices), so it is unclear whether the base-10 mechanism reflects a principled architectural bias or simply the overwhelming frequency of decimal notation in pretraining data; a critic would push back that testing on a cyclic domain where the period is not a round decimal number—say, the 360 degrees of a compass or the 52 weeks of a year expressed only as ordinal words—would be necessary to distinguish these accounts. Additionally, the analysis is confined to a single model, Llama-3.1-8B, so the generality claim rests entirely on the stated prediction rather than on cross-model evidence presented in the paper itself.
Methods (1)
- Causal InterventionTechnique used to confirm that circular structures are real computational objects, not merely visualization artifacts.
Findings (8)
- Llama-3.1-8B implements a two-stage algorithm: (1) compute integer sum via base-10 addition (e.g., six + August = 14), then (2) map sum to cyclic concept space (14 → February)
The complete mechanistic algorithm discovered for cyclic concept reasoning
- Llama-3.1-8B uses task-agnostic Fourier features with periods 2, 5, and 10 (base-10) rather than concept-specific periods (e.g., 12 for months)
The specific Fourier feature periods identified confirm base-10 rather than modular computation
- The 28 MLP neurons at layer 18 can be partitioned into disjoint clusters each computing the sum for a Fourier feature with a different period
Structural finding showing modular organization within the sparse neuron set
- Llama-3.1-8B reuses a single generic addition mechanism across all cyclic tasks independently of concept-specific geometry
Key mechanistic finding showing task-agnostic reuse of arithmetic circuitry
- Fourier features with period 10 contribute to base-10 sum computation in the 28-neuron cluster
One of the three base-10 Fourier periods identified in the sparse neuron set
- Llama-3.1-8B uses base-10 addition rather than modular addition to compute cyclic concept sums
The central empirical finding that computation does not mirror the circular representational structure
- A sparse set of 28 MLP neurons at layer 18 (~0.2% of MLP) are reused across all cyclic tasks
Quantitative finding identifying the specific neurons responsible for generic addition
- Llama-3.1-8B representations for cyclic concepts are circularly structured
The representation geometry finding that motivates the question about whether computation mirrors it
Claims (5)
- Language models prefer reusing generic arithmetic mechanisms over learning task-specific modular computations even when task-specific geometry exists
Broader interpretive claim about LM learning bias inferred from the findings
- Approximately 0.2% of MLP neurons at layer 18 (~28 neurons) are sufficient to account for the generic addition computation across all cyclic tasks
Claim about the sparsity and sufficiency of the identified neuron set
- The Fourier feature periods (2, 5, 10) respect standard base-10 addition structure rather than cyclic concept periodicity
Mechanistic claim linking identified Fourier features to base-10 arithmetic
- Circular representational structure does not imply that the model computes using the corresponding modular arithmetic
The paper's primary interpretive claim, directly answering the central question
- An interplay between causal abstraction and feature geometry deepens mechanistic understanding of language models
Methodological claim about the scientific value of combining causal abstraction with representational geometry analysis
Hypotheses (1)
- We hypothesize that Llama-3.1-8B deploys the same base-10 addition circuitry for cyclic reasoning as it uses for general arithmetic, independent of the concept domain
Predictive hypothesis about domain-generality of the identified mechanism
Questions (3)
- Does Llama compute modular addition or base-10 addition for cyclic tasks?
The specific computational question the paper resolves empirically
- Does structure in representations imply structure in computation?
The central research question motivating the paper
- How does Llama-3.1-8B reason over cyclic concepts?
The empirical question the paper addresses through mechanistic investigation
Original abstract (expand)
Does structure in representations imply structure in computation? We study how Llama-3.1-8B reasons over cyclic concepts (e.g., "what month is six months after August?"). Even though Llama-3.1-8B's representations for these concepts are circularly structured, we find that instead of directly computing modular addition in the period of the cyclic concept (e.g., 12 for months), the model re-uses a generic addition mechanism across tasks that operates independently of concept-specific geometry. First, it computes the sum of its two inputs using base-10 addition (six + August=14). Then, it maps this sum back to cyclic concept space (14->February). We show that Llama-3.1-8B uses task-agnostic Fourier features to compute these sums--in fact, these features have periods that respect standard base-10 addition, e.g., 2, 5, and 10, rather than the cyclic concept period (e.g., 12 for months). Furthermore, we identify a sparse set of 28 MLP neurons re-used across all tasks (approximately 0.2% of the MLP at layer 18) that can be partitioned into disjoint clusters, each computing the sum for a Fourier feature with a different period. Our work highlights how an interplay between causal abstraction and feature geometry can deepen our mechanistic understanding of LMs.
Related work— refs + corpus + external arXiv
Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.
- The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasetscitedin corpus2023≈ 80%
- ≈ 81%
- ≈ 80%
- Successor Heads: Recurring, Interpretable Attention Heads In The WildEuan Ong, George Ogden, Arthur Conmy Rhys Gould2023≈ 79%
- Towards Interpretable Sequence Continuation: Analyzing Shared Circuits in Large Language ModelsPhilip Torr, Fazl Barez Michael Lan2024≈ 79%
- ≈ 79%
- StreetMath: Study of LLMs' Approximation BehaviorsSomshubhra Roy, Maisha Thasin, Danyang Zhang, and Blessing Effiong Chiung-Yi Tseng2025≈ 79%
- Mechanistic Interpretability of Large-Scale Counting in LLMs through a System-2 StrategyMohammadali Banayeeanzade, Ali Nafisi, Sadegh Mohammadian, Fatemeh Askari, Mobin Bagherian, Amirmohammad Izadi, Mahdieh Soleymani Baghshah Hosein Hasani2026≈ 78%
- Understanding Counting Mechanisms in Large Language and Vision-Language ModelsAmirmohammad Izadi, Fatemeh Askari, Mobin Bagherian, Sadegh Mohammadian, Mohammad Izadi, Mahdieh Soleymani Baghshah Hosein Hasani2026≈ 78%
- Where Bits Matter in World Model Planning: A Paired Mixed-Bit Study for Efficient Spatial ReasoningAnish Patnaik, Vaishak Menon Suraj Ranganath2026≈ 78%
- Counting Circuits: Mechanistic Interpretability of Visual Reasoning in Large Vision-Language ModelsZhiyu Xue, Yihao Quan, Benlin Liu, Zeru Shi, Michelle Hurst, Jacob Feldman, Ruixiang Tang, Ranjay Krishna, Vladimir Pavlovic Liwei Che2026≈ 78%
- Identity as Attractor: Geometric Evidence for Persistent Agent Architecture in LLM Activation SpaceVladimir Vasilenko2026≈ 77%
- Stochastic Chameleons: Irrelevant Context Hallucinations Reveal Class-Based (Mis)Generalization in LLMsMeng Cao, Marc-Antoine Rondeau, Jackie Chi Kit Cheung Ziling Cheng2025≈ 77%
- What do Language Models Learn and When? The Implicit Curriculum HypothesisKaiser Sun, Millicent Li, Isabelle Lee, Lindia Tjuatja, Jen-tse Huang, Graham Neubig Emmy Liu2026≈ 77%
- ≈ 77%
- A geometric notion of causal probingcited2023≈ 77%
- A Implies B: Circuit Analysis in LLMs for Propositional Logical ReasoningNishanth Dikkala, Enming Luo, Cyrus Rashtchian, Xin Wang, Rina Panigrahy Guan Zhe Hong2025≈ 77%
- ≈ 77%
- Beyond Language: Format-Agnostic Reasoning Subspaces in Large Language ModelsZhiyuan Su Aojie Yuan2026≈ 77%
- ≈ 76%
- ≈ 76%
- ≈ 76%
- ≈ 76%
- Testing the Limits of Truth Directions in LLMsin corpus2026≈ 76%
- Linda in contextin corpus1989≈ 75%
- Finger Exercises in Formal Concept Analysisin corpus2006≈ 75%
- ≈ 75%
- Covariance-based Sequence Poolingin corpus2026≈ 75%
- Mechanistic Knobs in LLMs: Retrieving and Steering High-Order Semantic Features via Sparse Autoencodersin corpus2026≈ 75%
- ≈ 74%
+22 more