question
active
question:can-an-interpretable-symbolic-algorithm-be-used-to-faithfully-explain-a-complex-neural-network-model

Can an interpretable symbolic algorithm be used to faithfully explain a complex neural network model?

Framing question for the paper's research program.

Source paper

extracted_from
Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations
(2023) · Atticus Geiger · Zhengxuan Wu · Christopher Potts · Thomas Icard +1

Neighborhood — ranked by edge-count

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.