matched-pairs design

Experimental design where injection strengths are swapped between sentences in two parts of each trial to cancel positional preferences

Neighborhood — ranked by edge-count

claim

method

Strength Comparison Task
uses
Novel task asking which of two sentences received a stronger injection, using matched-pairs design to control for positional bias

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Contrast Pairsconcept0.805
Pairs of statements with opposite truth values used as input to CCS; e.g., cities and neg_cities paired statements
Contrastive Pairsconcept0.788
Pairs of prompts at different reflection levels used to compute steering vectors.
Pair Typeframework0.779
Indexable container with denotation as Bool → a; example demonstrating derivation of API instances from semantic denotation.
Design For Individualityframework0.747
Aligned by Designconcept0.741
Paper's proposed strategy of instilling intrinsic moral cognition so AI remains aligned even as capabilities expand
Uniform Pairs (Pair a)concept0.739
Pair is an indexable container with index type Bool.claim0.738
Denotational insight for Pair data type.
Representation Designconcept0.734
The aspect of design dealing with data structures, modules, and implementation.