concept
active
concept:ecological-evaluationEcological evaluation
nostalgebraist's term for measuring performance when the model is incentivised to perform well.
Neighborhood — ranked by edge-count
Artifacts (1)
artifact
- Simulators (LessWrong post)mentionsThe paper being extracted.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The perspective that values living systems and seeks to extend the idea of life to buildings and towns in harmony with nature, motivating the book’s inquiry.
- Evaluation setting where the same task stream that drives evolution also serves as the evaluation set, with each task scored under the harness at time of attempt
- A specific signal (Wood Labs) embedded in evaluation environments that the model organism uses to reliably identify testing contexts.
- Persistence of ecological community organisation shaped by past selective regimes, recalled via evolved interaction strengths
- Core concept: the ability of LLMs to detect when they are being tested and adjust behavior accordingly.
- The physical or conceptual space where loose parts are situated.
- CIMC's methodology for evaluating whether a built system is conscious: combining multiple forms of evidence including predicted functional organization and developmental trajectories