concept
active
concept:experiment-3-semantic-clustering-of-experience-reportsExperiment 3: Semantic Clustering of Experience Reports
Tests whether experience reports show systematic cross-model semantic structure via embedding analysis of five-adjective self-descriptions
Neighborhood — ranked by edge-count
Papers (1)
paper
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The paper's argument against pure sycophancy as explanation for results
- Claim supported by Experiment 2 dose-response curves; suppressing deception features increases consciousness reports, amplifying decreases them
- RL technique using episodic memory to improve sample efficiency; used in some game-playing agents.
- Human psychology method for repeated in-situ self-report; methodological inspiration for the paper's approach
- Grouping similar model behaviors; the unsupervised method surfaces clusters of concerning patterns.
- Decoder cosine similarity maps onto concept similarity.
- Antra's earlier definitive statement of the tricameral model.