finding
active
finding:suppression-of-deception-features-produces-higher-truthfulqa-accuracy-m-0-44-than-amplification-m-0-20-t-816-6-76-p-1-5-10-10-across-29-categories

Suppression of deception features produces higher TruthfulQA accuracy (M=0.44) than amplification (M=0.20), t(816)=6.76, p=1.5×10⁻¹⁰ across 29 categories

Out-of-domain generalization showing deception features track general representational honesty

Source paper

extracted_from
Large Language Models Report Subjective Experience Under Self-Referential Processing
(2025) · Berg, Cameron · de Lucena, Diogo · Rosenblatt, Judd

Neighborhood — ranked by edge-count

Claims (2)

claim

Concepts (1)

concept
  • The proposed domain-general property indexed by deception features that governs both factual accuracy and experiential self-report

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.