finding
active
finding:learned-rotation-matrices-are-non-trivial-majority-of-basis-vectors-are-rotated-indicating-highly-distributed-representations

Learned rotation matrices are non-trivial: majority of basis vectors are rotated, indicating highly distributed representations

Learned rotations reveal that direct probes over standard activation bases would miss the actual causal role of representations.

Source paper

extracted_from
Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations
(2023) · Atticus Geiger · Zhengxuan Wu · Christopher Potts · Thomas Icard +1

Neighborhood — ranked by edge-count

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.