framework
active
framework:deepseek-r1

DeepSeek-R1

Open-source reasoning LLM from DeepSeekAI trained with reinforcement learning to exhibit self-reflection

Neighborhood — ranked by edge-count

Thinkers (1)

thinker
  • DeepSeekAI
    studies
    Organization that introduced DeepSeek-R1 and reported the aha moment of self-reflection

Frameworks (2)

framework
  • ReflCtrl
    studies
    The proposed framework for probing and steering self-reflection behavior in reasoning LLMs via representation engineering
  • Cost-efficient training algorithm used by DeepSeek-R1 for RL-based reasoning

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.