concept
active
concept:zheng-et-al-2023-judging-llm-as-a-judge-with-mt-bench-and-chatbot-arenaZheng et al. 2023 - Judging LLM-as-a-judge with MT-Bench and Chatbot Arena
Source paper for the MT-Bench evaluation benchmark used to assess capabilities post-SOO fine-tuning
Neighborhood — ranked by edge-count
Papers (1)
paper
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Comparison to external leaderboards showing misalignment.
- LLM judge (deepseek-v3) agrees with human evaluator on 91.6% of 200 sampled jailbreak responsesfinding0.749Validates the LLM-based harm evaluation rubric
- Alternative data attribution approach using an LLM as a judge; compared against the probe-based method.
- An LLM-based classifier that returns 1 if response contains a clear subjective experience report and 0 otherwise
- Core claim directly challenged by prior work denying introspection; forms foundation for Koan Battery introspection studies.
- Establishes that the observed linear structure is not merely a representation of text probability
- LLM alignment score to DINOv2 shows an emergence-esque trend with GSM8K mathematical reasoning performancefinding0.729Alignment predicts math performance with emergent pattern
- Using Claude Sonnet 4 as a grader to categorize model responses according to predefined criteria.