method
active
method:next-day-arithmetic-taskNext-Day Arithmetic Task
The evaluation task used to probe Llama's representation of days of the week: questions of the form 'What day comes N days after X?'
Neighborhood — ranked by edge-count
Papers (1)
paper
Concepts (1)
concept
- days of the weekaboutPrimary case study demonstrating circular manifold structure in both behavior and representation space of Llama-3.1-8B.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Application of Next Closure to enumerate all concept intents of a formal context.
- The problem of ensuring all tasks in MTL perform well, avoiding dominance by some tasks.
- One of four ToM tasks analyzed; requires inferring speaker intent from indirect hints; scored 0/1.
- Application of Next Closure to enumerate all concept extents of a formal context.
- The paper identifies task difficulty as a key moderator: easy MMLU questions show performative CoT, hard GPQA-Diamond questions show genuine reasoning
- Moderate pretraining exposure numeral system used in E2.
- Three synthetic arithmetic datasets of increasing complexity requiring 1, 2, or 3 operations to verify correctness.
- Novel task asking which of two sentences received a stronger injection, using matched-pairs design to control for positional bias