claim
active
claim:trainable-intervention-das-finds-sparser-gender-representations-than-linear-probing-suggesting-probing-overestimates-causal-coverage

Trainable intervention (DAS) finds sparser gender representations than linear probing, suggesting probing overestimates causal coverage

Interpretive claim from Case Study II about the distinction between correlational probes and causal interventions

Source paper

extracted_from
pyvene: A Library for Understanding and Improving PyTorch Models via Interventions
(2024) · Zhengxuan Wu · Atticus Geiger · Aryaman Arora · Jing Huang +4

Neighborhood — ranked by edge-count

Findings (1)

finding

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Restated by (1)

cosine ≥ 0.90

Other entities that say roughly the same thing. May be merge candidates or independent restatements across papers.