finding
active
finding:llama-3-1-8b-output-token-distributions-for-seven-days-of-the-week-form-seven-clusters-in-a-rough-circle-in-behavior-space-hellinger-distance-geometryLlama-3.1 8B output token distributions for seven days of the week form seven clusters in a rough circle in behavior space (Hellinger distance geometry).
Empirical observation establishing that Llama's behavior for days-of-week tasks has circular structure.
Source paper
extracted_fromNeighborhood — ranked by edge-count
Papers (1)
paper
Claims (1)
claim
- The paper's causal explanation for why representation and behavior geometry both appear circular for days of the week.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Empirical observation establishing that Llama's internal representations for days-of-week have circular geometric structure.
- Core empirical result demonstrating that manifold steering produces on-target, behavior-aligned outputs.
- Empirical result demonstrating the failure mode of linear steering when concept geometry is cyclic.
- Demonstrates that small models represent surface features rather than abstract truth
- The specific Fourier feature periods identified confirm base-10 rather than modular computation
- Larger models linearly represent more general concepts including truth
- Third promising case from temporal permutation analysis.
- The complete mechanistic algorithm discovered for cyclic concept reasoning