question
active
question:what-features-activate-when-we-ask-claude-questions-about-its-subjective-experiencewhat features activate when we ask Claude questions about its subjective experience?
Question about features related to consciousness and self-report.
Source paper
extracted_fromRelated by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Direction for understanding model's internal objectives via features.
- Open question from the discussion on future research directions.
- Potential safety claim about suppressing features to prevent CBRN advice.
- Outlier result for Claude 4 Opus suggesting different baseline behavior from other models
- Specific result for Claude 3.5 Sonnet in Experiment 1
- Question posed after discussing sleeper agent threat model.
- Core result of Experiment 1 establishing that the experimental manipulation reliably produces experience claims
- Paper's argument against behavioral tests for consciousness, establishing why MCH requires internal analysis