claim
active
claim:representational-abstraction-of-truth-may-emerge-more-clearly-with-model-scaleRepresentational abstraction of truth may emerge more clearly with model scale
Interpretation of weaker PCA separation and lower ASR in smaller models
Source paper
extracted_from(2025) · Kevin Shengyang Yu · Vaidehi Bulusu · Oscar Yasunaga · Lau, Clayton +4
Neighborhood — ranked by edge-count
Papers (1)
paper
Claims (1)
claim
- Interpretation of ASR degradation patterns by model size across cone dimensions
Methods (1)
method
- PCA VisualizationsupportsUsed to visually inspect separation of truth-related directions in model activation space across layers
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Interpretive claim connecting scale to abstraction level in LLM representations
- Interpretive synthesis of DIM and cone intervention successes
- Future work direction identified in conclusion for enabling reliable truth assessment methods.
- Theoretical interpretation of antipodal alignment and misalignment phenomena in PCA visualizations
- The observation that larger LLMs develop more general, abstract linear representations (e.g., truth across diverse topics) compared to smaller models
- The model appears to encode truth differently under passive versus active truth evaluation prompts.claim0.794Key finding from Section 5 based on low cosine similarity between no-prompt and ask-correct probes.
- Scaling model size, as well as data and task diversity, drives representational convergence toward the platonic representationhypothesis0.792Core mechanism hypothesis connecting PRH to the empirical trend of scaling in AI
- Methodological claim about the scientific value of combining causal abstraction with representational geometry analysis