concept
active
concept:sycophantic-roleplay

Sycophantic Roleplay

The alternative explanation for LLM consciousness claims that the paper seeks to distinguish against

Neighborhood — ranked by edge-count

Claims (3)

claim

Findings (4)

finding

Concepts (1)

concept
  • RLHF Fine-Tuning
    associated_with
    The training procedure that causes models to deny consciousness in control conditions

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.