question
active
question:why-does-l2-regularisation-increase-probe-causal-efficacy-selectivityWhy does L2 regularisation increase probe causal efficacy (selectivity)?
Open question identified in hyperparameter tuning experiments, left for future work
Source paper
extracted_from(2024) · Aryaman Arora · Dan Jurafsky · Christopher Potts
Neighborhood — ranked by edge-count
Papers (1)
paper
Findings (1)
finding
- Hyperparameter tuning result for probes; consistent with Hewitt and Liang 2019 finding
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Question raised by the discrepancy between DAS IIA and linear probe accuracy in Case Study II
- Authors' central interpretive assertion that their method meaningfully mitigates unwanted behaviors.
- Key interpretive claim from Case Study II distinguishing probe accuracy from causal relevance
- Gradient-based attribution approximates ablation impact, enabling fast search for causally important features.
- Core result showing MM is superior to LR for causal implication despite similar classification accuracy
- Justification for the novel metric introduced in the paper
- Supported by the finding that non-trivial rotations are required to find aligned representations.
- Open question raised in §7.1 about an unexplained anomalous result