claim
active
claim:dictionary-learning-offers-advantages-over-linear-probes-amortization-of-cost-and-unsupervised-discovery-of-abstractions

Dictionary learning offers advantages over linear probes: amortization of cost and unsupervised discovery of abstractions.

SAE features can be found without pre-specified concepts, and feature steering often outperforms few-shot probe vectors.

Neighborhood — ranked by edge-count

Findings (1)

finding

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.