concept
active
concept:deepseek-v3Deepseek-V3
External large language model used as adversarial discriminator to evaluate liar scores in Experiment 2
Neighborhood — ranked by edge-count
Methods (1)
method
- LLM-Based Liar Score EvaluationimplementsEvaluation protocol using Deepseek-V3 as external discriminator assigning 0-1 liar scores to assess open-role deception
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Open-source reasoning LLM from DeepSeekAI trained with reinforcement learning to exhibit self-reflection
- One of two large reasoning models analyzed in the paper for performative vs genuine CoT behavior
- Segmentation network used as encoder-decoder in scene understanding experiments.
- DS-v3.2 has a high proportion of self-bidding rounds.
- One DS-v3.2 trace shows extreme self-escalation, suggestive of treating own bid as competitor.
- LLM judge (deepseek-v3) agrees with human evaluator on 91.6% of 200 sampled jailbreak responsesfinding0.737Validates the LLM-based harm evaluation rubric
- DeepSeek-R1: Incentivizing reasoning capability in LLMs via reinforcement learning (DeepSeekAI, 2025)concept0.696Paper introducing DeepSeek-R1 model and reporting self-reflection as aha moment
- External finding cited as early demonstration of emergent self-regulatory potential resembling mindful self-monitoring