concept
active
concept:deepseek-r1-incentivizing-reasoning-capability-in-llms-via-reinforcement-learning-deepseekai-2025

DeepSeek-R1: Incentivizing reasoning capability in LLMs via reinforcement learning (DeepSeekAI, 2025)

Paper introducing DeepSeek-R1 model and reporting self-reflection as aha moment

Neighborhood — ranked by edge-count

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.