hypothesis
active
hypothesis:we-hypothesize-that-polysemantic-neurons-may-be-resolvable-by-unfolding-networks-or-training-to-avoid-polysemanticity

We hypothesize that polysemantic neurons may be resolvable by unfolding networks or training to avoid polysemanticity.

Forward-looking proposal for how the polysemanticity challenge to circuits research might be overcome

Source paper

extracted_from
Zoom In: An Introduction to Circuits
(2020) · Chris Olah · Nick Cammarata · Ludwig Schubert · Gabriel Goh +2

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.