finding
active
finding:clamping-addition-feature-active-on-non-addition-code-tricks-the-model-into-believing-it-has-been-asked-to-execute-addition

Clamping addition feature active on non-addition code tricks the model into believing it has been asked to execute addition.

Causal effect showing the feature governs computation.

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.