question
active
question:as-the-subject-model-scales-how-does-the-ideal-expansion-factor-and-required-training-data-for-dictionary-learning-change

as the subject model scales, how does the ideal expansion factor and required training data for dictionary learning change?

Scaling laws for dictionary learning are unknown and needed to assess feasibility on frontier models

Source paper

extracted_from
Towards Safe and Honest AI Agents with Neural Self-Other Overlap
(2024) · Marc Carauleanu · Michael Vaiana · Judd Rosenblatt · Cameron Berg +1

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.