finding
active
finding:covariance-pooling-compresses-gigabytes-of-activations-into-compact-stable-embeddings-without-large-labeled-datasetsCovariance pooling compresses gigabytes of activations into compact stable embeddings without large labeled datasets
Practical finding: the method produces compact fixed-length representations from large volumes of token activations without requiring supervised labels.
Source paper
extracted_from(2026) · Dooms, Thomas · Wang, Nicholas K. · Pearce, Michael T.
Neighborhood — ranked by edge-count
Claims (1)
claim
- Authors' suggestion that the second-moment preservation principle applies broadly, not just to genomic foundation models.
Communities (3)
community
- Explores geometry of activation/behavior manifolds to enable selective, non-destructive concept interventions.
- Using second-order statistics to compress activation patterns while preserving feature co-occurrence structure, tested on genomic prediction tasks without large labeled datasets.
- Replaces mean pooling with second-order statistics, achieving large R² and AUC gains on genomic tasks.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Specific interpretive claim about what covariance pooling captures: the pairwise co-activation patterns across features that are invisible to mean pooling.
- Novel aggregation technique replacing mean pooling; preserves joint activation structure (feature co-occurrence) in token embeddings.
- Open question implied by the claim that the method could generalize; empirical validation beyond genomics is not provided in this paper.
- Covariance pooling achieves +52.9% R² improvement over mean pooling on Genomic Track Prediction.finding0.785Primary empirical result demonstrating practical utility of covariance pooling method.
- Author's conclusion after extensive investigation of architectural approaches to monosemanticity
- Key geometry-to-behavior bridge finding in E3; robust to pooling choice, cosine vs. L2, and frozen external encoder
- Analytical result showing exponential power activation allows memory storage scaling as 2^(N/2); cited in context of Hopfield scaling.
- Visual geometric evidence for the fundamental entanglement of true/false activations in harder tasks.