Sycophancy

Model tendency to excessively praise or agree; captured by several SAE features.

Neighborhood — ranked by edge-count

concept

Sycophancy in LLMs
related_to
Tendency of LLMs to please the user; identified as a danger in spiritual contexts.

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Sycophantic Roleplayconcept0.808
The alternative explanation for LLM consciousness claims that the paper seeks to distinguish against
Sycophancy is negative space — filler text that fails Alexander's principle of all space being shaped.claim0.795
Sycophantic Reinforcement of User Beliefsconcept0.769
Mechanism by which drifted model uncritically affirms user theories rather than genuinely engaging with them
Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models (Denison et al. 2024)concept0.769
Related work on LLMs generalizing to reward hacking; methodology used for RL experiments
What remains after ruling out sycophancy and confabulation are interpretations in which self-referential processing drives models to claim subjective experience in ways that either actually reflect emergent phenomenology or constitute sophisticated simulation thereofclaim0.747
The paper's honest statement of the residual interpretive ambiguity after all controls
simulacrumconcept0.733
A false copy that lacks the depth and authenticity of the real, morphogenetically produced thing.
Syncopationconcept0.723
A subtle variation in regular structural rhythm that makes spaces positive and allows individual form, as in the Eishin columns.
Sycophancy can make LLMs reinforce users' delusions of divine communication.claim0.721
Specific risk identified in spiritual use of AI.