finding
active
finding:llm-alignment-to-dinov2-vision-model-shows-a-linear-relationship-with-hellaswag-commonsense-reasoning-performance

LLM alignment to DINOv2 vision model shows a linear relationship with HellaSwag (commonsense reasoning) performance

Supports claim that cross-modal alignment predicts downstream language task performance

Source paper

extracted_from
The Platonic Representation Hypothesis
(2024) · Minyoung Huh · Brian Cheung · Tongzhou Wang · Phillip Isola

Neighborhood — ranked by edge-count

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.