Covariance-based Sequence Pooling

ByThomas Dooms·Nicholas K. Wang·Michael T. Pearce

LLM Interpretability & Behavioral Analysis LLM interpretability & self-awareness LLM Introspection EVEE / Mayo Work Unsupervised autoencoder embeddings Second moments Sequence pooling

TL;DR

Covariance pooling — replacing mean pooling with second-moment statistics over token embeddings — yields a +52.9% gain in R² on genomic track prediction and a +8.4% AUC improvement on Gene Ontology prediction when applied to genomic foundation models. The method, introduced by Dooms, Wang, and Pearce at Goodfire, computes pairwise feature co-occurrence structure across a sequence's token embeddings rather than collapsing them to a single mean vector, thereby preserving joint activation patterns that mean pooling discards by construction. These gains hold with unsupervised autoencoder embeddings, requiring no large labeled datasets — the compact covariance representations are derived from gigabytes of raw activations compressed into stable, fixed-size matrices. The method emerged as a methodological side-product of the EVEE/Mayo collaboration, suggesting its origin was empirical rather than theoretical. The deeper claim is a structural one: first moments (means) are insufficient summaries of embedding geometry whenever the discriminative signal lives in feature co-occurrence rather than marginal activation levels. The paper argues this implies mean-pooling baselines are systematically underperforming across any domain where token interaction structure is predictively relevant, making covariance pooling a candidate default for sequence-level representation in genomic and potentially other foundation-model pipelines.

What to take away

1. Covariance pooling over token embeddings from genomic foundation models achieves a +52.9% R² improvement over mean pooling on a genomic track prediction benchmark.
2. On Gene Ontology prediction, covariance pooling with unsupervised autoencoder embeddings raises AUC by +8.4% relative to the mean-pooling baseline.
3. The method requires no large labeled datasets, deriving compact stable embeddings by compressing gigabytes of raw model activations into fixed-size second-moment matrices.
4. The central methodological claim is that mean pooling discards joint activation structure (feature co-occurrence) by collapsing to first moments, while covariance pooling retains pairwise feature statistics across all sequence positions.
5. The covariance pooling method was developed as a side-product of the EVEE/Mayo genomics collaboration at Goodfire, indicating the benchmark tasks were drawn from real applied genomics workflows rather than synthetic benchmarks.
6. To replicate the core comparison, a researcher would compute the full token-by-token covariance matrix from a frozen genomic foundation model's residual stream, flatten or compress it, and train a linear probe alongside a matched mean-pooling probe on the same split.
7. The authors raise the open question of whether the second-moment advantage generalizes beyond genomics to any sequence domain where token interaction structure — rather than marginal activation levels — carries the predictive signal.
8. Goodfire's framing positions covariance pooling as a candidate default aggregation method for genomic foundation models, implicitly predicting that mean-pooling baselines in published genomics benchmarks are systematically underreported in their ceiling.
9. The unsupervised autoencoder embedding condition (yielding +8.4% AUC on Gene Ontology) is specifically notable because it demonstrates the gain does not depend on task-supervised representation learning.
10. Covariance pooling operates on the same frozen model activations as mean pooling, meaning the computational overhead is confined to the aggregation step and does not require retraining or fine-tuning the underlying genomic foundation model.

Peer brief — for seminar discussion

Dooms, Wang, and Pearce at Goodfire introduce covariance pooling, a sequence aggregation method that replaces the conventional mean pooling of token embeddings with a second-moment (covariance) summary computed across all token positions in a sequence. Applied to genomic foundation models, the method computes pairwise feature co-occurrence statistics from the full token embedding matrix, producing a compact fixed-size representation without requiring labeled data or model fine-tuning. The load-bearing finding is a +52.9% R² improvement on a genomic track prediction task and a +8.4% AUC gain on Gene Ontology prediction, both relative to mean-pooling baselines, with the Gene Ontology result obtained using unsupervised autoencoder embeddings. These are not marginal gains: a 52.9% R² lift suggests mean pooling is severely misspecified for this representational regime, not merely suboptimal. The paper argues the mechanism is structural — mean pooling is a first-moment statistic and is lossless only when the downstream signal is linear in marginal activations; when discriminative information lives in feature co-activation patterns (which is likely in genomic sequences where regulatory motif combinations matter), the mean discards the signal by construction. An alternative aggregation strategy the work does not benchmark against is attention-weighted pooling or CLS-token projection, which is a notable omission since those methods also attempt to preserve relational structure across positions, though at higher parametric cost. The method originated as a methodological byproduct of the EVEE/Mayo collaboration, and the two benchmark tasks — genomic track prediction and Gene Ontology classification — reflect that applied context. The broader hypothesis, stated explicitly, is that the second-moment advantage should generalize beyond genomics to any domain where token interaction geometry is predictively relevant. A critical reader would push back on external validity: both benchmark tasks are from a single application domain (genomics) and likely share architectural priors — the genomic foundation models used are not named in the available summary, making it impossible to assess whether the gains are model-specific (e.g., tied to a particular tokenization scheme or embedding dimensionality that inflates covariance signal). Until covariance pooling is evaluated on at least one non-genomic sequence model (e.g., a protein language model like ESM-2 650M or a text transformer), the generalization claim rests on theoretical argument rather than evidence. The compression from gigabytes of activations to stable matrices is also underspecified — the exact rank reduction or autoencoder architecture used in the unsupervised condition would materially affect reproducibility.

Methods (1)

Unsupervised autoencoder embeddings
Method used alongside covariance pooling for the Gene Ontology prediction task; produces embeddings without large labeled datasets.

Findings (3)

Covariance pooling compresses gigabytes of activations into compact stable embeddings without large labeled datasets
Practical finding: the method produces compact fixed-length representations from large volumes of token activations without requiring supervised labels.
Gene Ontology prediction: +8.4% AUC improvement with unsupervised autoencoder and covariance pooling embeddings
Empirical result: covariance pooling combined with unsupervised autoencoder embeddings improves Gene Ontology prediction AUC by 8.4% over mean pooling.
Covariance pooling achieves +52.9% R² improvement over mean pooling on Genomic Track Prediction.
Primary empirical result demonstrating practical utility of covariance pooling method.

Claims (4)

Covariance pooling preserves joint activation structure (feature co-occurrence) that mean pooling discards
Specific interpretive claim about what covariance pooling captures: the pairwise co-activation patterns across features that are invisible to mean pooling.
Covariance pooling could generalize beyond genomics as a general-purpose replacement for mean pooling
Authors' suggestion that the second-moment preservation principle applies broadly, not just to genomic foundation models.
Second moments preserve structure that first moments destroy.
Core interpretive claim generalizing beyond genomics; argues mean pooling discards information present in covariance.
Geometry of features matters for representation quality.
General principle supported tangentially by covariance pooling work; relates to feature co-occurrence structure.

Questions (1)

Can covariance pooling generalize beyond genomics to other domains?
Open question implied by the claim that the method could generalize; empirical validation beyond genomics is not provided in this paper.

Related work— refs + corpus + external arXiv

Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.

Entropy, Disagreement, and the Limits of Foundation Models in Genomics
Lovro Vr\v{c}ek, Mile \v{S}iki\'c Maxime Rochkoulets
2026
≈ 83%
From Token Lists to Graph Motifs: Weisfeiler-Lehman Analysis of Sparse Autoencoder Features
Pablo Magari\~nos-Docampo, Javier Perez-Robles Ruben Fernandez-Boullon
2026
≈ 80%
Noether Networks: Meta-Learning Useful Conserved Quantities
Dylan Doblar, Allan Zhou, Joshua Tenenbaum, Kenji Kawaguchi, Chelsea Finn Ferran Alet
2021
≈ 80%
CorrSteer: Generation-Time LLM Steering via Correlated Sparse Autoencoder Features
Zekun Wu, Adriano Koshiyama Seonglae Cho
2026
≈ 80%
Semantic Convergence: Investigating Shared Representations Across Scaled LLMs
Sanjana Rathore, Andrew Rufail, Adrian Simon, Daniel Zhang, Soham Dave, Cole Blondin, Kevin Zhu, Sean O'Brien Daniel Son
2025
≈ 80%
What Cohort INRs Encode and Where to Freeze Them
Sophie Starck, Robbie Holland, Julian McGinnis, Daniel Rueckert Vasiliki Sideri-Lampretsa
2026
≈ 79%
Mechanistic Decomposition of Sentence Representations
Vikram Natarajan, Jonathan Michala, Milton Lin, Juri Opitz Matthieu Tehenan
2025
≈ 79%
From Where Words Come: Efficient Regularization of Code Tokenizers Through Source Attribution
Pavel Chizhov and Egor Bogomolov and Ivan P. Yamshchikov
2026
≈ 79%
Beyond Redundancy: Diverse and Specialized Multi-Expert Sparse Autoencoder
Zhen Tan, Song Wang, Kaidi Xu, Tianlong Chen Zhen Xu
2025
≈ 79%
Interpreting token compositionality in LLMs: A robustness analysis
Danilo S. Carvalho, Andr\'e Freitas Nura Aljaafari
2025
≈ 79%
Unveiling interpretable development-specific gene signatures in the developing human prefrontal cortex with ICGS
Xiucai Ye (1 and 2), Tetsuya Sakurai (1 and 2) ((1) University of Tsukuba, (2) Center for Artificial Intelligence Research in University of Tsukuba) Meng Huang (1)
2022
≈ 79%
Exemplar Partitioning for Mechanistic Interpretability
Jessica Rumbelow
2026
≈ 79%
Sparse Autoencoder Decomposition of Clinical Sequence Model Representations: Feature Complexity, Task Specialisation, and Mortality Prediction
Feng Dong, Andreas Karwath Chris Sainsbury
2026
≈ 79%
From Data Statistics to Feature Geometry: How Correlations Shape Superposition
Edward Stevinson, Melih Barsbey, Tolga Birdal, Pedro A.M. Mediano Lucas Prieto
2026
≈ 79%
A Unified Theory of Sparse Dictionary Learning in Mechanistic Interpretability: Piecewise Biconvexity and Spurious Minima
Harshvardhan Saini, Zhaoqian Yao, Zheng Lin, Yizhen Liao, Jingyi Cui, Yisen Wang, Mengnan Du, Dianbo Liu Yiming Tang
2026
≈ 79%
Mechanistic Knobs in LLMs: Retrieving and Steering High-Order Semantic Features via Sparse Autoencoders
in corpus
2026
≈ 78%
Addressing divergent representations from causal interventions on neural networks
in corpus
2025
≈ 77%
Explaining 4.2 million genetic variants with state-of-the-art, interpretable predictions
in corpus
2026
≈ 77%
Mechanistic Interpretability of EEG Foundation Models via Sparse Autoencoders
in corpus
2026
≈ 77%
Persistence and Introspection of Emotion Features
in corpus
≈ 76%
Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations
in corpus
2023
≈ 76%
Darwin's agential materials: evolutionary implications of multiscale competency in developmental biology
in corpus
2023
≈ 76%
The Platonic Representation Hypothesis
in corpus
2024
≈ 76%
The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasets
in corpus
2023
≈ 76%
A Mathematical Framework for Transformer Circuits
in corpus
2021
≈ 76%
Model Alignment Search
in corpus
2025
≈ 76%
The Guanyin Protocol: A Framework for Immediately Establishing an Understanding of Both Causality and Compassion in LLM Systems Using Semantic Anchoring
in corpus
2025
≈ 75%