claim
active
claim:llms-internalize-deeply-integrated-representations-of-high-order-conceptsLLMs internalize deeply integrated representations of high-order concepts.
The authors' interpretive assertion based on their steering results.
Source paper
extracted_from(2026) · Ruikang Zhang · Shuo Wang · Q. Su
Neighborhood — ranked by edge-count
Papers (1)
paper
Findings (1)
finding
- Empirical effect observed in feature intervention experiments.
Communities (3)
community
- Relational self, care & alivenessmembers_ofSelf as dynamic functional center defined by care, coherence, and substrate-neutral cognition
- Investigates whether LLMs genuinely represent complex concepts or merely simulate understanding through pattern matching, using interventional methods and epistemic humility frameworks.
- Examines whether transformer models develop introspectable, high-order concept representations architecturally.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- High-dimensional vectors produced at each transformer layer for each input token; the primary substrate analyzed in this study.
- Core claim directly challenged by prior work denying introspection; forms foundation for Koan Battery introspection studies.
- Interpretive claim connecting scale to abstraction level in LLM representations
- Primary positive claim of the paper, grounded in strength comparison and localization results
- The finding that interpretable concepts including character traits are encoded as linear directions in transformer residual streams
- Qualified positive claim from spatio permutation analysis where two cases satisfy all three criteria.
- Central thesis statement of the paper
- Do LLMs leverage architectural capacity for introspection on internal computations and prior token generation?question0.799Central empirical question separating architectural possibility from actual model behavior; gates introspection research.