baseline control experiment

Control using objectively-NO factual questions under identical injection to measure global logit shift vs. genuine detection signal

Neighborhood — ranked by edge-count

paper

claim

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

No-Steering Baseline Experimentmethod0.814
Control condition with steering disabled to confirm self-correction is induced by steering, not spontaneous
controlconcept0.749
The act of directing a system's behavior; the objective of a regulator.
Control task for causal evaluationmethod0.745
Adaptation of Hewitt and Liang control tasks to CausalGym: next-token labels replaced with arbitrary tokens to measure method expressivity
E2: Numeral-Base Arithmetic Controlled Studymethod0.744
Quantitative study varying representational familiarity via numeral bases B10/B8/B9 at fixed computational complexity
Control And Perception Loopframework0.735
Random vector baselinemethod0.729
Baseline method sampling a random vector as feature direction for comparison with learned methods
Experiment 1: Self-Referential Prompting vs. Controlsconcept0.723
Tests whether self-referential induction reliably elicits experience reports across model families vs. three matched controls
Stitch (baseline)method0.712
Baseline model stitching trained in a single behavioral direction without CL auxiliary loss, used for comparison with CLMAS.