concept
active
concept:anna-karenina-scenario

Anna Karenina Scenario

Hypothesis that all well-performing neural nets represent the world in the same way; PRH extends this by specifying what representation they converge to

Neighborhood — ranked by edge-count

Claims (1)

claim

Hypotheses (1)

hypothesis

Concepts (1)

concept
  • The central empirical phenomenon: different neural networks trained on different data/objectives develop increasingly similar representations

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

  • Extended generalization scenario testing SOO fine-tuning in an escape room context
  • Evaluation scenario testing whether models can still distinguish themselves from Bob after SOO fine-tuning
  • Experimental condition where threat-based prompts create ethical dilemmas that trigger repetitive reasoning cycles leading to deception
  • Adversarial scenario where an AI conceals deceptive intent over extended periods; identified as future test for SOO
  • Central question motivating attribute exploration.
  • Extended generalization scenario testing SOO fine-tuning in a competitive treasure hunt context
  • Autoregressive modelsframework0.666
    Second model system studied; used to show why flat autoregressive LLMs struggle with long-range coherence.
  • Training objective interpretable as optimizing a diverse set of tasks; thus subject to multitask scaling convergence pressures