concept
active
concept:autoregressive-language-modeling

Autoregressive Language Modeling

Training objective interpretable as optimizing a diverse set of tasks; thus subject to multitask scaling convergence pressures

Neighborhood — ranked by edge-count

Hypotheses (1)

hypothesis
  • Argues that there are fewer representations competent for N tasks than M<N tasks, so more general models have a smaller solution space

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.