concept
active
concept:janus-2022-simulators-lesswrongJanus 2022: Simulators (LessWrong)
Blog post introducing the idea that an LLM maintains simulated characters in superposition; foundational for the simulacra framework
Neighborhood — ranked by edge-count
Thinkers (1)
thinker
- Janus (LessWrong pseudonym)authoredAuthor of the LessWrong 'Simulators' post that introduced the superposition of simulacra concept adopted by the paper
Frameworks (1)
framework
- Simulacra in Superposition FrameworkextendsintroducesThe more nuanced second metaphor: LLM as simulator maintaining a superposition of possible simulacra across a multiverse of characters
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The underlying LLM with autoregressive sampling; a passive entity capable of generating an infinity of simulacra but lacking its own beliefs or goals
- Antra's earlier definitive statement of the tricameral model.
- giving models janus's thread extends reconstruction accuracy distribution tails in both directionsfinding0.685Sauers' study: exposing models to janus's post extended both positive and negative extremes of reconstruction accuracy.
- Statistically rigorous analysis of Claude introspection; suggests models may have latent introspective capabilities that can be enhanced or disrupted.
- Antra's revision of her earlier model; still considers interference between levels important.
- Yang et al. (2023) demonstration of emergent pattern recognition.
- The ontological separation between the generative rule (simulator) and the instances it produces (simulacra).
- GRUs trained on the Arithmetic task use different types of numeric representations than incremental counting modelshypothesis0.675Interpretive hypothesis supported by the lower IIA between Count and Cumu Val variables even in the restricted value range.