method
active
method:next-day-arithmetic-task

Next-Day Arithmetic Task

The evaluation task used to probe Llama's representation of days of the week: questions of the form 'What day comes N days after X?'

Neighborhood — ranked by edge-count

Concepts (1)

concept
  • Primary case study demonstrating circular manifold structure in both behavior and representation space of Llama-3.1-8B.

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

  • Application of Next Closure to enumerate all concept intents of a formal context.
  • Task balancingconcept0.724
    The problem of ensuring all tasks in MTL perform well, avoiding dominance by some tasks.
  • Hinting Taskmethod0.721
    One of four ToM tasks analyzed; requires inferring speaker intent from indirect hints; scored 0/1.
  • Application of Next Closure to enumerate all concept extents of a formal context.
  • Task Difficultyconcept0.716
    The paper identifies task difficulty as a key moderator: easy MMLU questions show performative CoT, hard GPQA-Diamond questions show genuine reasoning
  • Base-8 arithmeticconcept0.714
    Moderate pretraining exposure numeral system used in E2.
  • Three synthetic arithmetic datasets of increasing complexity requiring 1, 2, or 3 operations to verify correctness.
  • Novel task asking which of two sentences received a stronger injection, using matched-pairs design to control for positional bias