EVEE: Interpretable variant effect prediction from genomic foundation model embeddings

ByMichael T Pearce·Thomas Dooms ⓘ·Ryō Yamamoto ⓘ·Joshua Meehl·Carl Molnar·Mark Bissell+16 moreGoodfire, Radical AI

DOI 10.64898/2026.04.10.717844 OpenAlex W7153570004

Original abstract (expand)

Abstract Predicting the clinical significance of genetic variants remains a central challenge in genomic medicine, with most observed variants classified as variants of uncertain significance. Here we show that representations from Evo 2, a 7-billion-parameter genomic foundation model, support accurate and interpretable pathogenicity prediction across variant types from a single framework. An embedding-based classifier, or “probe”, trained on Evo 2 embeddings achieves state-of-the-art performance across single nucleotide variant consequence types (0.997 overall AUROC on 833k ClinVar variants) and generalizes zero-shot to indels (0.991 AUROC), outperforming bioinformatic meta-predictors, protein models, and existing foundation model approaches. Performance is robust across conservation levels and transfers to deep mutational scanning datasets for BRCA1, BRCA2, TP53, and LDLR. To make these predictions interpretable, we train supervised annotation probes to quantify predicted disruptions caused by each variant, then synthesize these disruption profiles into natural language explanations using a frontier reasoning model. We provide pre-computed predictions and on-demand explanations for all 4.2 million ClinVar variants through the Evo Variant Effect Explorer (EVEE), an interactive web resource for the community. This work establishes that representations from genomic foundation models can serve as a unified substrate for both accurate variant effect prediction and mechanistic interpretation, reframing interpretability in computational genomics from a trade-off into a complementary product of learned biological structure.

Related work— refs + corpus + external arXiv

Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.

Explaining 4.2 million genetic variants with state-of-the-art, interpretable predictions
in corpus
2026
≈ 93%
Unveiling interpretable development-specific gene signatures in the developing human prefrontal cortex with ICGS
Xiucai Ye (1 and 2), Tetsuya Sakurai (1 and 2) ((1) University of Tsukuba, (2) Center for Artificial Intelligence Research in University of Tsukuba) Meng Huang (1)
2022
≈ 81%
EVA: Towards a universal model of the immune system
Vincent Bouget, Apolline Bruley, Yannis Cattan, Charlotte Claye, Matthew Corney, Julien Duquesne, Karim El Kanbi, Aziz Fouch\'e, Pierre Marschall, Francesco Strozzi Scienta Team: Ethan Bandasack
2026
≈ 81%
Entropy, Disagreement, and the Limits of Foundation Models in Genomics
Lovro Vr\v{c}ek, Mile \v{S}iki\'c Maxime Rochkoulets
2026
≈ 80%
BIOGEN: Evidence-Grounded Multi-Agent Reasoning Framework for Transcriptomic Interpretation in Antimicrobial Resistance
Mehrdad Shoeibi, Ivan Garibay and Niloofar Yousefi Elias Hossain
2026
≈ 80%
GenoBERT: A Language Model for Accurate Genotype Imputation
Chuan Qiu, Kuan-Jui Su, Anqi Liu, Yun Gong, Weiqiang Lin, Lindong Jiang, Chen Zhao, Meng Song, Jeffrey Deng, Qing Tian, Zhe Luo, Ping Gong, Hui Shen, Chaoyang Zhang, and Hong-Wen Deng Lei Huang
2026
≈ 80%
Evaluating Post-hoc Explanations of the Transformer-based Genome Language Model DNABERT-2
Paulo Yanez Sarmiento, Bernhard Y. Renard Isabel Kurth
2026
≈ 80%
Learning biologically relevant features in a pathology foundation model using sparse autoencoders
Ciyue Shen, Neel Patel, Chintan Shah, Darpan Sanghavi, Blake Martin, Alfred Eng, Daniel Shenker, Harshith Padigela, Raymond Biju, Syed Ashar Javed, Jennifer Hipp, John Abel, Harsha Pokkalla, Sean Grullon, Dinkar Juyal Nhat Minh Le
2024
≈ 79%
Discovery of Disease Relationships via Transcriptomic Signature Analysis Powered by Agentic AI
Ke Chen and Haohan Wang
2025
≈ 79%
Mechanistic Interpretability of EEG Foundation Models via Sparse Autoencoders
in corpus
2026
≈ 79%
Ultrafast topological data analysis reveals pandemic-scale dynamics of convergent evolution
Lukas Hahn, Maximilian Neumann, Zachary Ardern, Juan Angel Patino-Galindo, Mathieu Carriere, Ulrich Bauer, Raul Rabadan, Andreas Ott Michael Bleher
2026
≈ 79%
MS-ConTab: Multi-Scale Contrastive Learning of Mutation Signatures for Pan Cancer Representation and Stratification
Adam Khadre, Ruben C Petreaca, Golrokh Mirzaei Yifan Dou
2025
≈ 78%
Covariance-based Sequence Pooling
in corpus
2026
≈ 78%
A Multi-Level Causal Intervention Framework for Mechanistic Interpretability in Variational Autoencoders
Rajiv Misra, Sanjay Kumar Singh, Anisha Roy Dip Roy
2026
≈ 78%
Contextual Invertible World Models: A Neuro-Symbolic Agentic Framework for Colorectal Cancer Drug Response
Karen Rafferty, Hui Wang Christopher Baker
2026
≈ 78%
Sparse Autoencoder Decomposition of Clinical Sequence Model Representations: Feature Complexity, Task Specialisation, and Mortality Prediction
Feng Dong, Andreas Karwath Chris Sainsbury
2026
≈ 78%
When AI Does Science: Evaluating the Autonomous AI Scientist KOSMOS in Radiation Biology
Humza Nusrat and Omar Nusrat
2025
≈ 78%
Revisiting Gene Ontology Knowledge Discovery with Hierarchical Feature Selection and Virtual Study Group of AI Agents
Cen Wan and Alex A. Freitas
2026
≈ 78%
Emergence and Causality in Complex Systems: A Survey on Causal Emergence and Related Quantitative Studies
in corpus
2023
≈ 76%
CausalGym: Benchmarking causal interpretability methods on linguistic tasks
in corpus
2024
≈ 76%
Addressing divergent representations from causal interventions on neural networks
in corpus
2025
≈ 76%
Anima Labs Phenomenology Pt1
in corpus
≈ 75%
The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasets
in corpus
2023
≈ 75%
Active Inference, Curiosity and Insight
in corpus
2017
≈ 75%
pyvene: A Library for Understanding and Improving PyTorch Models via Interventions
in corpus
2024
≈ 75%
Paper Summary: Interpreting Language Model Parameters
in corpus
≈ 74%
Model Alignment Search
in corpus
2025
≈ 74%
Multimodal Chain-of-Thought Reasoning in Language Models
in corpus
2023
≈ 74%

Related work— refs + corpus + external arXiv

Similar preprints — Semantic Scholar