claim
active
claim:variations-in-tom-test-score-categories-are-more-likely-attributed-to-span-level-information-of-the-llm-representation-sequence-rather-than-to-a-consciousness-phenomenon-as-suggested-by-iit-estimatesVariations in ToM test score categories are more likely attributed to span-level information of the LLM representation sequence rather than to a 'consciousness' phenomenon as suggested by IIT estimates.
Main interpretive finding from Criterion 3 comparison showing Span Representation consistently outperforms IIT under temporal permutation.
Source paper
extracted_from(2025) · Li, Jingkai
Neighborhood — ranked by edge-count
Findings (1)
finding
- Contrasts with temporal permutation where Span Representation dominates; suggests spatio permutation reveals different dynamics.
Concepts (1)
concept
- The primary paper being extracted — applies IIT 3.0 and 4.0 to LLM representation sequences derived from ToM test data to investigate whether consciousness phenomena can be observed.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Criterion 3 operationalization: requires IIT mean AUC to exceed Span Representation mean AUC.
- Specific prediction linking IIT's prediction of high Φ for good performance to the experimental design's scoring structure.
- Primary conclusion of the study based on temporal permutation analysis failing all three criteria.
- Third of three operational criteria; distinguishes consciousness from inherent LLM representational separations.
- Qualified positive claim from spatio permutation analysis where two cases satisfy all three criteria.
- The paper's claim that theoretical convergence across GWT, RPT, HOT, IIT makes the findings non-coincidental
- Derived from the finding that linguistic span focusing on complements/MSV yields no significant IIT estimate changes.
- Motivates the hybrid approach combining IIT, Span Representation, and multiple criteria.