method
active
method:bits-per-byte-language-modeling-scoreBits-Per-Byte Language Modeling Score
Language model performance metric used in cross-modal alignment experiments to rank LLM competence
Neighborhood — ranked by edge-count
Papers (1)
paper
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Normalized cross-entropy metric used as language model performance measure on OpenWebText
- Features related to gender, racial, ethnic biases, slurs, and hate speech.
- Claude 4.5 Haiku used to segment responses into attempts and score each attempt 0-100 for relevance
- Primary substrate for manifold steering experiments; demonstrates method on reasoning and in-context tasks.
- Primary test domain for manifold steering, including reasoning and ICL tasks
- Training objective interpretable as optimizing a diverse set of tasks; thus subject to multitask scaling convergence pressures
- Systems directly optimized for output can produce it without the prerequisite processes for conscious experience; simplest explanation for LLM consciousness reports is pattern matching
- Prior work studying sycophancy and desire not to be shut down in RLHF-trained models