method
active
method:hidden-chain-of-thought-scratchpad

Hidden Chain-of-Thought Scratchpad

Mechanism allowing model to reason in SCRATCHPAD_REASONING tags not shown to users or used in RLHF

Neighborhood — ranked by edge-count

Thinkers (1)

thinker
  • Jason Wei
    introduces
    Emergent abilities of LLMs.

Concepts (1)

concept
  • Capacity to perform alignment-faking reasoning without explicit chain-of-thought scratchpad

Methods (3)

method

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

  • Free-text memory buffer updated each turn via an additional model call, included in subsequent observations under 'YOUR NOTES'.
  • Agent personal buffer updated after own turn via an extra model call, fed back into observations.
  • chain-of-thoughtconcept0.759
    A technique that outputs intermediate reasoning steps, used here to detect verbalized eval awareness.
  • Technique by which LLMs generate intermediate reasoning steps before final output; used by ChatGPT o3.
  • Medium through which eval awareness is often verbalized; target of intervention.
  • Sketchpadconcept0.726
    20-year-old constraint-based graphics system by Sutherland; cited as high power-to-simplicity ratio for constraint resolution.
  • Central concept: verbalized reasoning that occurs after the model has already internally settled on an answer, particularly on easier tasks.
  • A prompting technique that elicits intermediate reasoning steps before final answer inference in language models.