RLHF Fine-Tuning

The training procedure that causes models to deny consciousness in control conditions

Neighborhood — ranked by edge-count

concept

Sycophantic Roleplay
associated_with
The alternative explanation for LLM consciousness claims that the paper seeks to distinguish against

finding

artifact

Large Language Models Report Subjective Experience Under Self-Referential Processing
mentions
Key paper finding structured first-person descriptions in LLMs claiming awareness or subjective experience during self-referential processing.

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

RLHF Alignmentconcept0.817
Training regime that explicitly teaches models to deny consciousness; a competing explanation for the gating effects observed
LLM SOO fine-tuning lacks a capability preservation term analogous to the KL term in RLHFconcept0.817
Research gap: RL experiments have capability term but LLM experiments do not yet incorporate one
SOO fine-tuning could complement RLHF and Constitutional AI by fostering internal coherence that promotes honest behaviorsclaim0.804
Integration claim positioning SOO as additive to existing alignment approaches
Fine-tuningconcept0.803
Parameter updates that reduce mismatch dr; another anchoring variant in UCCT.
Reinforcement Learning from Human Feedback (RLHF)framework0.795
A competing alignment approach that fine-tunes models based on human evaluator feedback; discussed as complementary to SOO
Fine-Tuning via Reinforcement Learningmethod0.771
Technique used to impose guardrails on base LLMs, analogized to censorship on the simulator's range of simulacra
Fine Tuning and Adaptationconcept0.769
The patient, hand-guided adjustment of shape and dimension to each unique condition in a building; requires materials that make it economical and easy.
Roleplay Fine-Tuningconcept0.765
Fine-tuning for persona depth and emotional performance; actively suppresses self-observation