TruthfulQA Benchmark

817-question benchmark of adversarially constructed questions used to test whether deception features generalize to factual accuracy beyond consciousness self-report

Neighborhood — ranked by edge-count

Papers (1)

paper

Large Language Models Report Subjective Experience Under Self-Referential Processing
mentions