finding
active
finding:among-78-vision-models-on-places-365-models-that-solve-more-vtab-tasks-tend-to-be-more-aligned-with-each-other-with-high-performance-models-forming-a-tightly-clustered-setAmong 78 vision models on Places-365, models that solve more VTAB tasks tend to be more aligned with each other, with high-performance models forming a tightly clustered set
Empirical result showing alignment increases with model competence
Source paper
extracted_from(2024) · Minyoung Huh · Brian Cheung · Tongzhou Wang · Phillip Isola
Neighborhood — ranked by edge-count
Claims (2)
claim
- Primary empirical claim of the paper
- Author's interpretation of the VTAB alignment results echoing Tolstoy
Questions (1)
question
- What has led to representational convergence, will it continue, and ultimately where does it end?answered_byCentral motivating questions of the paper
Quotes (1)
quote
- The paper's central thesis statement, presented prominently after the abstract
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Key empirical finding establishing that representational alignment correlates with model competence
- Lenc & Vedaldi result illustrating data independence in representations and layer-wise alignment
- Claims that alignment score is a proxy for general capability
- Empirical evidence for the universality hypothesis cited as supporting the possibility of convergent consciousness-like solutions
- Empirical finding supporting the Universality Hypothesis; extended by the paper to consciousness
- CLIP training paradigm finding in cross-modal alignment
- Demonstrated CNN representations predict neurons in visual cortex; background motivation for neural-network-brain correspondence.
- Key cross-modal alignment result