concept
active
concept:output-truthOutput-truth
The correctness of a model's generated outputs, distinct from the correctness of statements provided as input.
Neighborhood — ranked by edge-count
Concepts (1)
concept
- Input-truthassociated_withCorrectness of input statements to an LLM, as opposed to output-truth (correctness of model-generated outputs).
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Specification relating a program's inputs and outputs, analogous to illocutionary correctness.
- Future work direction identified in conclusion for enabling reliable truth assessment methods.
- Artifactual behaviors produced when interventions cut off the data manifold, e.g., via linear steering.
- Assumption that every output class can be produced by the DNN in each layer; key condition for Theorem 1
- The paper's operationalization of truthfulness as simple, unambiguous propositional statements that can be labeled true or false
- A model mapping outputs to expected inputs, used in motor control and perception for embodiment.
- The mechanism by which each step's effect is evaluated against the life of the whole, guiding the unfolding.