concept
active
concept:aurocAUROC
Performance metric for binary classification; used to evaluate pathogenicity prediction.
Neighborhood — ranked by edge-count
Papers (2)
paper
Methods (1)
method
- Cross-task generalization evaluationimplementsMeasuring AUROC of a probe trained on one task when evaluated on another task to assess universality.
Related by similarity (3)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Automated benchmarking framework for evaluating LLM meta-cognition, mentioned as related work.
- Lab behind Claude models and Constitutional AI training approach; represents highest baseline scores and lowest prompt lift.
- Classification-based comparison of interpretation abilities across IIT metrics and Span Representation for ToM score categories.