thinker:orcid-0000-0002-4483-5005Patrick E. Duffy
Authored papers (2)
Evee, a variant-pathogenicity platform built on the Evo 2 genomic foundation model, achieves 0.997 AUROC on a 839,000-variant ClinVar benchmark — outperforming all previously reported methods — while simultaneously generating mechanistic "disruption profiles" that explain *why* a variant is predicted pathogenic rather than producing a scalar score alone. Zero-shot performance on insertions and deletions reaches 0.991 AUROC, a regime where many supervised methods degrade sharply. The system scales to approximately 4.2 million variants in total, including roughly 2 million variants of uncertain significance (VUS) for which no ground-truth labels exist and for which the disruption profiles constitute the primary clinical output. In a structured human evaluation, disruption profiles scored 3.8/5 for explanation quality against 2.8/5 for metadata-only baselines — a 36% relative gain in perceived explanatory value. The method extracts these profiles from Evo 2's internal representations via a sparse-feature-attribution pipeline developed jointly by Goodfire and Mayo Clinic. The paper argues this demonstrates that mechanistic interpretability applied to large biological sequence models can close the gap between black-box accuracy and clinician-usable reasoning, making genome-wide functional annotation tractable for direct clinical deployment on VUS that currently stall diagnostic workflows.
More papers — OpenAlex / S2
Affiliations (2)
- Mayo Clinic(concept)
- Goodfire(institute)
Co-authors (12)
- Alexander J. Ryu2 shared
- Archa Jain2 shared
- Bridget Toomey2 shared
- Carl Molnar2 shared
- Ching Fang2 shared
- Collin Osborne2 shared
- Daniel Balsam2 shared
- Dron Hazra2 shared
- Elena Myasoedova2 shared
- Eric W. Klee2 shared
- Joshua Meehl2 shared
- Mark Bissell2 shared
Recent mentions (1)
- papers-typedpearce-goodfire-evee-genetic-variants-2026.md