claim
active
claim:llms-sometimes-know-statements-are-false-but-generate-them-anyway-motivating-the-need-for-techniques-that-inspect-internal-model-state-rather-than-outputs-alone

LLMs sometimes know statements are false but generate them anyway, motivating the need for techniques that inspect internal model state rather than outputs alone

Motivating claim supported by the CAPTCHA example and Perez et al. (2022) findings

Neighborhood — ranked by edge-count

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.