finding
active
finding:sdf-training-used-115-6-million-tokens-rank-64-lora-learning-rate-1e-4SDF training used 115.6 million tokens (rank-64 LoRA, learning rate 1e-4)
Training details for first stage.
Source paper
extracted_from(2025) · Hua, Tim Tian · Qin, Andrew · Marks, Samuel · Nanda, Neel
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Training scale for second stage.
- Token usage varies roughly 20× across models, from ~14,800 (G3.1-FL) to ~275,000 (G3-F) per gamefinding0.723Reasoning verbosity does not predict strategic strength: both top and weak models span a wide range of token usage.
- DAS learning rate of 5e-3 outperforms 1e-3 (used in Wu et al. 2023) for small training sets in CausalGymfinding0.700Hyperparameter tuning result for DAS; different from prior work due to smaller training set size
- The specific SAE architecture trained: 100K+ possible features compressed to 64 active per token for layer-40 activations
- Basic SAE performance metrics.
- We hypothesize that a very high number of training tokens may allow the transformer to learn cleaner representations in superpositionhypothesis0.691Motivation for heavily overtraining the one-layer transformer on 100 billion tokens
- Ethical implication about the nature of AI training experience if the thesis holds
- verbose reasoning not required for strong play