question
active
question:what-semantic-labels-correspond-to-the-individual-basis-vectors-of-the-truth-coneWhat semantic labels correspond to the individual basis vectors of the truth cone?
Central open question for future work on interpretability of cone axes
Source paper
extracted_from(2025) · Kevin Shengyang Yu · Vaidehi Bulusu · Oscar Yasunaga · Lau, Clayton +4
Neighborhood — ranked by edge-count
Papers (1)
paper
Hypotheses (1)
hypothesis
- Future direction hypothesis for giving semantic meaning to individual axes
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The open problem of assigning interpretable semantic meaning to individual cone basis vectors (e.g., temporal vs geographic facts)
- Load-bearing illustration of what a concept cone for truth means operationally
- The underlying truth representation may generalize across lexical choices and languageshypothesis0.737Suggested by non-English Yes/No outputs post-intervention, requiring further investigation
- Key asymmetry between hierarchical equality and NLI experiments; BERT stores identities rather than the abstract relation.
- Validates that steering vectors capture reflection semantics by finding tokens reported in related work.
- Core applied contribution claim, supported by top-k accuracy comparisons.
- Concept cone truth interventions would generalize to larger frontier models and multimodal settingshypothesis0.723Key robustness question raised as future work
- Supported by the neutral read-prompt changing emergence but not fully replicating ask-correct cross-task generalization.