finding
active
finding:with-only-1-000-training-samples-nonlin-achieves-iia-over-0-99-on-training-set-for-identity-of-first-argument-algorithm-but-fails-at-scale

With only 1,000 training samples, ϕ_nonlin achieves IIA over 0.99 on training set for identity of first argument algorithm, but fails at scale

Confirms theorem's existence proof holds but practical learnability fails with insufficient RevNet capacity

Source paper

extracted_from
The Non-Linear Representation Dilemma: Is Causal Abstraction Enough for Mechanistic Interpretability?
(2025) · Sutter, Denis · Minder, Julian · Hofmann, Thomas · Pimentel, Tiago

Neighborhood — ranked by edge-count

Findings (1)

finding

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.