concept
active
concept:can-consciousness-be-observed-from-large-language-model-llm-internal-states-dissecting-llm-representations-obtained-from-theory-of-mind-test-with-integrated-information-theory-and-span-representation-analysis

Can 'Consciousness' Be Observed from Large Language Model (LLM) Internal States? Dissecting LLM Representations Obtained from Theory of Mind Test with Integrated Information Theory and Span Representation Analysis

The primary paper being extracted — applies IIT 3.0 and 4.0 to LLM representation sequences derived from ToM test data to investigate whether consciousness phenomena can be observed.

Neighborhood — ranked by edge-count

Thinkers (28)

thinker
  • Developer of integrated information theory; provides formal tools for measuring integration and consciousness in systems.
  • Emergent abilities of LLMs.
  • Philosopher referenced for 'what it's like' framework applied to understanding memory reconstruction from past self perspective.
  • Originator of Global Workspace Theory.
  • Lead author of 'Attention is all you need', introducing the transformer architecture
  • Co-author with Hoel and Tononi on quantitative causal emergence.
  • AI researcher, co-author of articles on artificial general intelligence and evolving AI.
  • Co-author of the classic Theory of Mind paper.
  • First author of Oizumi, Albantakis, & Tononi (2014) 'From the phenomenology to the mechanisms of consciousness: integrated information theory 3.0'.
  • Applied IIT to ECoG data showing elevated Φ in conscious visual perception; key empirical precedent for IIT.
  • Author whose work on comparing LLM representations with human brain activity during NLP informed this study's layer sampling strategy.
  • Author whose methodology for span representations from BERT is adopted in this study.
  • Co-author of a recent paper applying IIT to distinguish intelligence from consciousness in LLMs; argues feedforward LLMs yield low Φ.
  • Author of a key precedent study implementing IIT on resting-state fMRI data whose methodology this study closely adapts.
  • Lead author of the ToM test dataset used in this study.
  • Jingkai Li
    authored
    Sole author of the paper; affiliated with OpenSci.World in Montréal.
  • Author whose findings suggest intermediate-to-deep LLM layers best predict human brain activity; guides layer selection in this study.
  • Author who assessed ChatGPT consciousness using IIT alongside Turing Test, concluding low integration precludes consciousness.
  • Author of the span representation method (from ELMo paper) adopted in this study for computing Span Representations.

+4 more

Frameworks (6)

framework
  • Tononi et al. framework quantifying consciousness via integration; provides mathematical tools for measuring agent complexity.
  • Novel construct introduced by this paper: a hypothetical graph embedded in the time series of LLM representations, where each dimension is a node and latent connections are edges.
  • IIT 3.0
    implements
    Version 3.0 of IIT, used to compute Φmax and Conceptual Information (CI) from LLM representation networks.
  • IIT 4.0
    implements
    Version 4.0 of IIT, used to compute Φ and Φ-structure from LLM representation networks; latest iteration at time of study.
  • Framework for characterizing span-level information of sequences of representations, independent of any consciousness estimate; used as a comparison baseline.
  • The cognitive ability to attribute mental states to oneself and others; used as the empirical domain for testing LLM representations.

Claims (9)

claim

Methods (2)

method

Hypotheses (4)

hypothesis

Concepts (4)

concept

Questions (3)

question

Datasets (1)

dataset

Venues (1)

venue

Artifacts (1)

artifact

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.