dataset
pending-review
dataset:humanity-s-last-examHumanity's Last Exam
nguyen-2025-sfr-deepresearch.mdFrontmatter (9 fields)
{
"doc": "nguyen-2025-sfr-deepresearch.md",
"context": "Reasoning-focused benchmark covering math and science domains; SFR-DR-20B achieves 28.7% on the full text-only subset.",
"norm_label": "Humanity's Last Exam",
"graphify_id": "humanity_last_exam",
"source_file": "nguyen-2025-sfr-deepresearch.md",
"imported_from": "/Users/antonborzov/Documents/Research.nosync/papers/extract_typed_out/nguyen-2025-sfr-deepresearch/graph.json",
"extracted_type": "dataset",
"source_location": "§4.1",
"graphify_file_type": "dataset"
}Outgoing (0)
None.
Incoming (1)
mentions (1)
- SFR-DeepResearch(framework)
Mentions (1)
- papers-typed
nguyen-2025-sfr-deepresearch.md