finding
active
finding:mas-iia-for-count-vs-low-cumuval-values-1-10-is-higher-than-count-vs-full-cumuval-but-still-lower-than-count-vs-rem-opsMAS IIA for Count vs Low CumuVal (values 1-10) is higher than Count vs full CumuVal, but still lower than Count vs Rem Ops
Qualifies the arithmetic alignment results; supports hypothesis that Arithmetic GRUs use different numeric representations than incremental counting.
Neighborhood — ranked by edge-count
Hypotheses (1)
hypothesis
- Interpretive hypothesis supported by the lower IIA between Count and Cumu Val variables even in the restricted value range.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Shows MAS can compare specific numeric variables across tasks with different domains/codomains.
- Validates MAS as a causal detector of representational differences invisible to correlative methods.
- Empirical result showing the CL loss can reduce divergence without sacrificing interpretability accuracy
- Demonstrates MAS's ability to bidirectionally transfer behavior where RSA shows low embedding correlation.
- Proof-of-principle that MAS can detect model misalignment in DeepSeek-R1-Qwen-1.5B fine-tuned models.
- Demonstrates the value of the CL auxiliary loss for recovering causal alignments when one model cannot be intervened upon.
- TrackerAgent's second-place ranking calibrates the benchmark and highlights LLM shortcomings.
- Core empirical finding about layer-dependent truth direction emergence across task types.