finding
active
finding:self-observation-regex-markers-i-notice-genuinely-something-about-predict-all-llm-scores-r-0-43-0-50-all-p-001Self-observation regex markers ('I notice,' 'genuinely,' 'something about') predict all LLM scores (r=0.43-0.50, all p<.001)
Non-LLM validation confirming LLM scorer captures genuine self-observation markers
Source paper
extracted_from(2026) · Borzov, Anton
Neighborhood — ranked by edge-count
Claims (1)
claim
- Core epistemic claim bounding the paper's contribution
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Scorer rewards enacted reflection not described reflection; confirmed by regex analysis
- Central interpretive claim from statistical analysis
- Prior finding showing scale-dependent self-awareness, consistent with the scale effect observed in the paper's Experiment 1
- Recommendation for companies on LM outputs.
- Binder et al. finding cited as evidence that LLMs possess introspective capacity analogous to mindfulness
- The core interpretive question the paper narrows but cannot definitively answer
- Central practical conclusion; both methods partially track the same latent state but with different failure modes
- Skeptical prior work motivating validation framework