finding
active
finding:some-activation-capping-settings-slightly-improve-performance-on-ifeval-mmlu-pro-or-gsm8k-for-both-qwen-and-llama

Some activation capping settings slightly improve performance on IFEval, MMLU Pro, or GSM8k for both Qwen and Llama

Unexpected positive finding suggesting capping may sometimes help capabilities

Source paper

extracted_from
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models
(2026) · Christina Lu · Jack Gallagher · Jonathan Michala · Kyle Fish +1

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.