method
active
method:greedy-decoded-self-reportGreedy-decoded self-report
Baseline self-report method selecting highest-probability token; shown to collapse to few uninformative values
Neighborhood — ranked by edge-count
Methods (1)
method
- Logit-based self-reportcontradictsPrimary self-report measure: probability-weighted expected value over all ten digit-token logits, yielding a continuous rating that preserves full distributional signal
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Temperature=0.8 sampled decoding for self-report; reduces collapse moderately but remains discrete and noisy
- Greedy-decoded self-reports in LLaMA-3.2-3B collapse to 1.1–3.9 distinct values on a 10-point scalefinding0.793Demonstrates that default decoding masks introspective capacity; entropy 0.03–1.10 bits
- The model's verbal description of its internal state, which may be accurate or confabulated.
- A heuristic exploration strategy that selects a random action with probability epsilon, otherwise acts greedily.
- Central methodological contribution: computing probability-weighted expected value over digit-token logits recovers continuous, informative signal
- Concise framing of action-perception cycle whereby agents minimize surprise through perception and action.
- Process of reifying one's identity as an independent self; meditation practices aim to decrease selfing.