finding
active
finding:das-achieves-overall-odds-ratio-of-10-24-on-pythia-410m-averaged-across-all-causalgym-tasks

DAS achieves overall odds-ratio of 10.24 on pythia-410m averaged across all CausalGym tasks

Numerical result for pythia-410m

Source paper

extracted_from
CausalGym: Benchmarking causal interpretability methods on linguistic tasks
(2024) · Aryaman Arora · Dan Jurafsky · Christopher Potts

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.