quote
active
quote:for-interpretability-i-don-t-think-we-even-have-the-right-definitions

"For interpretability, I don't think we even have the right definitions."

Ian Goodfellow quote used to illustrate the pre-paradigmatic state of interpretability research

Source paper

extracted_from
Zoom In: An Introduction to Circuits
(2020) · Chris Olah · Nick Cammarata · Ludwig Schubert · Gabriel Goh +2

Neighborhood — ranked by edge-count

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.