claim
active
claim:disruption-profiles-are-higher-quality-explanations-than-metadata-only-descriptionsDisruption profiles are higher quality explanations than metadata-only descriptions
Claim supported by the 3.8 vs 2.8 human rating finding.
Source paper
extracted_from(2026) · Pearce, Michael · Dooms, Thomas · Yamamoto, Ryo · Meehl, Joshua +18
Neighborhood — ranked by edge-count
Papers (1)
paper
Findings (1)
finding
- Disruption profiles scored 3.8/5 for explanation quality vs 2.8/5 for metadata-only baselinessupportsEVEE's mechanistic explanations significantly outperform simple metadata-based predictions in human evaluation.
Communities (3)
community
- Spans attention head decomposition, benchmark awareness, and genomic pathogenicity prediction via neural models.
- Using genomic foundation model internals to generate disruption profiles that explain variant effects mechanistically, achieving 0.997 AUROC on ClinVar pathogenicity prediction.
- Disruption profile explanation qualitymembers_ofEmpirical evaluation showing disruption profiles outperform metadata-only baselines at 3.8 vs 2.8/5
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Mechanistic explanation outputs from EVEE showing how variants affect gene function, scored 3.8/5 for explanation quality.
- The output explanation format of Evee, quantifying how a variant disrupts different genomic features.
- Extrapolation from scale-emergence finding to future risk
- Authors' central interpretive assertion that their method meaningfully mitigates unwanted behaviors.
- Methodological justification for using SDF over direct demonstrations to train a realistic model organism.
- Conceptual framing: integrates mechanistic interpretability tools with alignment-focused data curation.
- Tested in Section 4.4 calibration experiment; confirmed by findings.
- Core limitation and usage heuristic: read NLAs for themes rather than individual factual claims; cross-check with original context.