method
active
method:feature-neighborhood-exploration-via-cosine-similarity-of-decoder-weights

Feature neighborhood exploration via cosine similarity of decoder weights

Identifying related features by cosine distance in SAE decoder space.

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.