finding
active
finding:commonsense-tasks-show-weaker-but-uniform-anchoring-on-llama-s-2-15Commonsense tasks show weaker but uniform anchoring on LLaMA (S ≈ −2.15)
E3 finding suggesting pattern matching requires less intensive processing than compositional reasoning
Source paper
extracted_from(2025) · Edward Yi Chang · Kaya, Zeyneb N. · Ethan Chang
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Task-specific comparison.
- Math and code tasks show strongest mid-layer anchoring on LLaMA (S ≈ −1.65 at layers 8-12)finding0.811Task-specific E3 finding showing compositional reasoning requires deeper processing
- Lower, more uniform anchoring for commonsense tasks
- Model-specific difference in persona susceptibility
- Replication across open-weight models supports scale-emergence finding
- Empirical result demonstrating the failure mode of linear steering when concept geometry is cyclic.
- Llama-3.3-70B exhibits internal consistency-checking mechanisms that operate during inferenceclaim0.770Central interpretive claim of the paper supported by causal ablation and activation evidence
- The specific computational question the paper resolves empirically