concept
active
concept:representational-mismatch-drrepresentational mismatch dr
Distance between prior and target representations.
Neighborhood — ranked by edge-count
Concepts (4)
concept
- Mismatch drrelated_toDistance between prior knowledge centroid and target pattern centroid, e.g., 1 - cos(eprior, eT).
- anchoring strength Sassociated_withComposite score S = ρd − dr − log k predicting anchoring success.
- shot midpoint k50associated_withNumber of in-context exemplars to reach 50% accuracy in E2.
- representational driftassociated_withAccumulation of mismatch in later layers causing S degradation.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Measures how far the target PT is from the prior P_prior; increases anchoring difficulty
- Measure of similarity between the similarity structures (kernels) induced by two different representations
- How familiar a model is with a numeral system, manipulated via bases in Experiment 2.
- ERP component reproduced by active inference: neural response to prediction violations.
- Core phenomenon studied: when causal interventions shift internal representations away from the natural distribution
- The phenomenon of model internals deviating from desired behavior; MAS is demonstrated to detect this via comparison of toxic vs nontoxic LLMs.
- The desired property of a bidirectional, behavior-preserving mapping between model representations; the goal MAS pursues.