finding
active
finding:deception-feature-suppression-yields-higher-truthfulness-in-28-of-29-evaluable-truthfulqa-categories

Deception feature suppression yields higher truthfulness in 28 of 29 evaluable TruthfulQA categories

Breadth of generalization of deception feature effects across independent reasoning domains in Experiment 2

Source paper

extracted_from
Large Language Models Report Subjective Experience Under Self-Referential Processing
(2025) · Berg, Cameron · de Lucena, Diogo · Rosenblatt, Judd

Neighborhood — ranked by edge-count

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.