finding
active
finding:clamping-golden-gate-bridge-feature-to-10x-max-activation-caused-the-model-to-self-identify-as-the-golden-gate-bridge

Clamping Golden Gate Bridge feature to 10x max activation caused the model to self-identify as the Golden Gate Bridge.

Strong causal evidence that the feature represents the bridge.

Neighborhood — ranked by edge-count

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.