finding
active
finding:a-vision-model-trained-on-imagenet-can-be-aligned-with-a-model-trained-on-places-365-while-maintaining-good-performance-and-early-layers-are-more-interchangeable-than-later-layersA vision model trained on ImageNet can be aligned with a model trained on Places-365 while maintaining good performance, and early layers are more interchangeable than later layers
Lenc & Vedaldi result illustrating data independence in representations and layer-wise alignment
Source paper
extracted_from(2024) · Minyoung Huh · Brian Cheung · Tongzhou Wang · Phillip Isola
Neighborhood — ranked by edge-count
Hypotheses (1)
hypothesis
- The central hypothesis of the paper; the platonic representation hypothesis itself
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- CLIP training paradigm finding in cross-modal alignment
- Empirical finding supporting the Universality Hypothesis; extended by the paper to consciousness
- Claims that alignment score is a proxy for general capability
- Empirical result showing alignment increases with model competence
- Empirical evidence for the universality hypothesis cited as supporting the possibility of convergent consciousness-like solutions
- Key empirical finding establishing that representational alignment correlates with model competence
- OpenAI GPT-4V finding supporting cross-modal training benefit
- Training on image data should improve LLM performance, and training on language data should improve vision model performancehypothesis0.775Implication of PRH for cross-modal training efficiency