hypothesis
active
hypothesis:we-hypothesize-that-a-representation-network-rn-emerges-from-llm-representations-where-each-dimension-is-a-node-and-latent-connections-exist-between-nodes-or-clusters-of-nodesWe hypothesize that a Representation Network (RN) emerges from LLM representations, where each dimension is a node and latent connections exist between nodes or clusters of nodes.
Core methodological hypothesis enabling the application of IIT to LLM representation sequences.
Source paper
extracted_from(2025) · Li, Jingkai
Neighborhood — ranked by edge-count
Concepts (1)
concept
- The primary paper being extracted — applies IIT 3.0 and 4.0 to LLM representation sequences derived from ToM test data to investigate whether consciousness phenomena can be observed.
Frameworks (1)
framework
- Representation Network (RN)groundsNovel construct introduced by this paper: a hypothetical graph embedded in the time series of LLM representations, where each dimension is a node and latent connections are edges.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The paper's concluding summary statement asserting the deep interpretive significance of representation geometry.
- The paper's deepest interpretive claim, asserting that representation structure and behavioral structure are not coincidentally aligned but deeply connected.
- Motivates the RN hypothesis by pointing to the unknown relational structure within high-dimensional representation vectors.
- Importance of hierarchical structure for flexible coordination.
- Key theoretical position distinguishing analysis of representations from analysis of LLM architecture.
- The paper's central thesis statement, presented prominently after the abstract
- Qualified positive claim from spatio permutation analysis where two cases satisfy all three criteria.
- Linear representation hypothesis: neural networks represent meaningful concepts as directions in their activation spaces.hypothesis0.786Foundation for interpreting features as linear directions.