finding
active
finding:ali-et-al-2025-found-contrastive-activation-addition-less-effective-at-larger-model-scale-consistent-with-esr-in-70b

Ali et al. 2025 found contrastive activation addition less effective at larger model scale, consistent with ESR in 70B

Prior finding from related work that aligns with ESR being strongest in the largest model tested

Source paper

extracted_from
Endogenous Resistance to Activation Steering in Language Models
(2026) · Alex McKenzie · Keenan Pepper · Stijn Servaes · Martin Leitgab +5

Neighborhood — ranked by edge-count

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.