finding
active
finding:l2-regularisation-with-bias-term-delivers-best-probe-performance-l2-regularisation-increases-probe-selectivityL2 regularisation with bias term delivers best probe performance; L2 regularisation increases probe selectivity
Hyperparameter tuning result for probes; consistent with Hewitt and Liang 2019 finding
Source paper
extracted_from(2024) · Aryaman Arora · Dan Jurafsky · Christopher Potts
Neighborhood — ranked by edge-count
Questions (1)
question
- Open question identified in hyperparameter tuning experiments, left for future work
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Larger models linearly represent more general concepts including truth
- Interpretation of low KL divergence results as validation of the training objective
- Implication of PRH for AI fairness and bias
- Key interpretive claim from Case Study II distinguishing probe accuracy from causal relevance
- Justification for the novel metric introduced in the paper
- We hypothesize that degraded generalization on benchmarks like MMLU may reflect the computational demands of the tasks.hypothesis0.745Connecting the paper's task-difficulty findings to prior observations of weak generalization on complex QA benchmarks.
- Authors' claim that their approach is both more effective in reduction and cheaper than prior methods.
- Justifies restricting probe-based vector derivation to h_b activations; attributed to Yes/No semantics