dev set calibration

Fixed dev pool of 1000 prompts used for whitening and z-scoring parameters.

Neighborhood — ranked by edge-count

method

whitening and z-scoring procedure
implements
Calibration protocol: whiten embeddings on dev pool, z-score ρd and dr per layer.

dataset

calibration dev pool
about
1000 mixed prompts used to compute whitening mean/covariance and z-scoring statistics.

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

S is a predictive correlate calibrated on dev sets, not an absolute measureclaim0.760
Clarifies nature of S.
Dev-Set Whitening and Z-Scoring Protocolmethod0.710
Preprocessing pipeline for standardizing ρd, dr, and S across layers/models using dev-set covariance
Accuracy Criterionconcept0.709
Criterion requiring that model's description of internal state be accurate, distinguishing genuine introspection from confabulation.
Anchor Calibration Responsesconcept0.709
Three reference responses at known quality levels shown alongside each target to eliminate score inflation in calibrated rubric scoring
Calibrated Few-Shot Promptingmethod0.702
Baseline method: sweeps over shot count and resamples prompts; calibrates threshold for P(TRUE)-P(FALSE); performed surprisingly weakly
SEAL (Steerable Reasoning Calibration)framework0.694
Prior work using steering vectors to control reflection, motivated by reducing redundant self-reflection in long CoT.
Architectural form should be calibrated with forces of environment through artificial design methods, extending natural processes of interaction between specific quantifiable forces.claim0.693
Alexander's structuralist approach treating design as homeostatic adaptation analogous to biological systems.
Fine-Tuning Threshold Recalibrationmethod0.692
Re-running probabilistic bisection on each fine-tuned checkpoint to normalize first-attempt difficulty