quote
active
quote:similar-to-how-a-programmer-might-try-to-reverse-engineer-complicated-binaries-into-human-readable-source-code

Similar to how a programmer might try to reverse engineer complicated binaries into human-readable source code.

Load-bearing analogy defining the aspirational goal of mechanistic interpretability

Source paper

extracted_from
A Mathematical Framework for Transformer Circuits
(2021) ·

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.