finding
active
finding:auditory-models-are-roughly-aligned-with-llms-up-to-a-linear-transformationAuditory models are roughly aligned with LLMs up to a linear transformation
Ngo & Kim result extending cross-modal convergence to the auditory domain
Source paper
extracted_from(2024) · Minyoung Huh · Brian Cheung · Tongzhou Wang · Phillip Isola
Neighborhood — ranked by edge-count
Claims (1)
claim
- Primary empirical claim of the paper
Hypotheses (1)
hypothesis
- The central hypothesis of the paper; the platonic representation hypothesis itself
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Core cross-modal empirical result: larger and better language models align better with vision models
- Establishes that the observed linear structure is not merely a representation of text probability
- Interpretive claim connecting scale to abstraction level in LLM representations
- Theoretical interpretation of antipodal alignment and misalignment phenomena in PCA visualizations
- Prior work framework studying whether LLMs encode world models as linear structures in their representations
- Key cross-modal alignment result
- Primary statistical model with random intercept by conversation, REML estimation, for pooled conversation-turn observations
- We hypothesize that LLMs represent correctness of arithmetic expressions differently from factual statements.hypothesis0.760Core working hypothesis motivating the factual vs. arithmetic task split in the experimental design.