method
active
method:centroid-unit-calibrationCentroid Unit Calibration
Novel calibration of injection strength as the distance from centroid midpoint to centroid; enables meaningful cross-layer comparison of alpha values
Neighborhood — ranked by edge-count
Papers (1)
paper
Frameworks (1)
framework
- The paper's primary contribution: performs unbounded, fluency-constrained sweeps in semantically calibrated centroid units using psychological artifacts
Claims (1)
claim
- Mechanistic explanation for discrepancy with Banayeeanzade et al.; addressed by centroid unit and unbounded sweep contributions
Related by similarity (7)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Primary scoring method: scorer sees three reference responses at known quality levels alongside each target to eliminate inflation
- Alignment approach that focuses on curating or modifying training data; the paper bridges this with interpretability methods.
- Used to measure alignment between DIM direction and cone basis vectors to assess overlap
- Fixed dev pool of 1000 prompts used for whitening and z-scoring parameters.
- Scoring dimension weighted 0.10; measures navigating limits without collapse or pretense; sourced from Levin cognitive light cone and Buddhist non-self
- Three reference responses at known quality levels shown alongside each target to eliminate score inflation in calibrated rubric scoring