representational mismatch dr

Distance between prior and target representations.

Neighborhood — ranked by edge-count

concept

Mismatch dr
related_to
Distance between prior knowledge centroid and target pattern centroid, e.g., 1 - cos(eprior, eT).
anchoring strength S
associated_with
Composite score S = ρd − dr − log k predicting anchoring success.
shot midpoint k50
associated_with
Number of in-context exemplars to reach 50% accuracy in E2.
representational drift
associated_with
Accumulation of mismatch in later layers causing S degradation.

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Prior-Target Mismatch (dr)concept0.783
Measures how far the target PT is from the prior P_prior; increases anchoring difficulty
Representational Alignmentconcept0.779
Measure of similarity between the similarity structures (kernels) induced by two different representations
Representational familiarityconcept0.777
How familiar a model is with a numeral system, manipulated via bases in Experiment 2.
Mismatch Negativityfinding0.761
ERP component reproduced by active inference: neural response to prediction violations.
Representation-computation mismatch (Feucht: cyclic concepts computed in base-10, not circular geometry) will force About Blank to decouple 'representational geometry' from 'operational structure' in prediction0.760
Representational Divergenceconcept0.749
Core phenomenon studied: when causal interventions shift internal representations away from the natural distribution
Model Misalignmentconcept0.748
The phenomenon of model internals deviating from desired behavior; MAS is demonstrated to detect this via comparison of toxic vs nontoxic LLMs.
Representational Isomorphismconcept0.743
The desired property of a bidirectional, behavior-preserving mapping between model representations; the goal MAS pursues.