method
active
method:autoregressive-samplingAutoregressive Sampling
The mechanism by which LLMs generate text: drawing a token from the next-token distribution and appending it to context repeatedly
Neighborhood — ranked by edge-count
Concepts (1)
concept
- Large Language Models (LLMs)implementsTransformer-based models like GPT-4, LaMDA, PaLM; assessed for GWT indicators.
Methods (1)
method
- autoregressive modelingrelated_toStatistical technique where outputs are regressed on previous values; used in language generation
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Second model system studied; used to show why flat autoregressive LLMs struggle with long-range coherence.
- Transformers are recurrent through autoregression because the K/V stream provides horizontal information flow across positions, even though each forward pass is feedforward.
- The training parallelization technique that latent methods are difficult to train with.
- Baseline persistence of any probe direction arising from the autoregressive nature of LLMs, not specific to emotion content
- Training objective interpretable as optimizing a diverse set of tasks; thus subject to multitask scaling convergence pressures
- A Bayesian exploration strategy that samples from the posterior distribution over model parameters to decide actions.
- A technique to filter model outputs; Redwood Research's project mentioned.
- Dividing feature activation spectrum into 11 evenly-spaced intervals and sampling uniformly to evaluate monosemanticity across activation levels