method
active
method:adam-optimizerAdam Optimizer
Used to optimize the policy and value networks
Neighborhood — ranked by edge-count
Papers (1)
paper
Methods (1)
method
- AdamW Optimizerrelated_toUsed to optimize the world model and self-prior
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- A learned optimizer running inside a base optimizer; transformers proposed as mesa-optimizers implementing gradient descent in-context
- Second training stage: samples responses, filters for type hints, and fine-tunes on filtered responses across four rounds to reinforce evaluation behavior.
- Light-gated ion channels used to control bioelectric states and dissect cellular computation.
- Framework for optimizing multiple objectives simultaneously, used in MTL.
- The drive to reduce expected ambiguity about outcomes given states, leading to seeking well-lit, informative environments.
- The progressive reduction of error (stress) as cells move toward their target positions.
- Second model system studied; used to show why flat autoregressive LLMs struggle with long-range coherence.
- Interpretability framework used to decompose layer-40 activations into sparse feature sets for studying emotional alignment and persistence