framework
active
framework:contemplative-reinforcement-learningContemplative Reinforcement Learning
Paper's proposed RL approach rewarding contemplative qualities in chain-of-thought reasoning
Neighborhood — ranked by edge-count
Concepts (2)
concept
- The primary source paper proposing four contemplative principles for AI alignment and piloting them empirically
- Medium through which eval awareness is often verbalized; target of intervention.
Frameworks (1)
framework
- Deliberative AlignmentextendsOpenAI's approach integrating chain-of-thought reasoning into alignment; parallels contemplative self-monitoring
Hypotheses (1)
hypothesis
- Key hypothesis about how Contemplative RL produces lasting intrinsic alignment rather than surface compliance
Findings (1)
finding
- External finding cited as early demonstration of emergent self-regulatory potential resembling mindful self-monitoring
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The paper's primary proposed framework embedding contemplative wisdom into AI alignment
- Alternative framework for agent behavior; based on reward maximization rather than free energy minimization.
- Six prompt conditions (emptiness, prior relaxation, non-duality, mindfulness, boundless care, contemplative) tested against baseline
- Meditative training that progressively opacifies the self-environment partition by modelling QRF dynamics.
- Field investigating how meditation reshapes cognition, brain function, and behavior; provides empirical grounding for AI alignment proposals
- Wallace's (2009) convergence of Buddhist contemplative practice and cognitive neuroscience.
- AI training method inspired by behaviorism, used for autonomous cars and drones; cited as bioinspired success
- Paper's proposed full-stack approach embedding contemplative principles directly into AI generative processes