concept
active
concept:representational-divergenceRepresentational Divergence
Core phenomenon studied: when causal interventions shift internal representations away from the natural distribution
Neighborhood — ranked by edge-count
Papers (1)
paper
Methods (1)
method
- Primary quantitative measure of distributional divergence between natural and intervened representations
Concepts (1)
concept
- Natural Distribution of Representationsassociated_withThe distribution of latent representations produced by the model under unperturbed inputs
Artifacts (1)
artifact
- Public code repository for reproducing the experiments in this paper
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The central empirical phenomenon: different neural networks trained on different data/objectives develop increasingly similar representations
- A measure of the difference between two probability distributions, used extensively in free energy formulations.
- Accumulation of mismatch in later layers causing S degradation.
- Divergences that activate hidden pathways or cause dormant behavioral changes, undermining mechanistic claims
- Divergences that occur in the behavioral null-space and do not affect functional claims about the model
- Practical utility of reducing divergence demonstrated through regression analysis
- CIMC's characterization of part of the solution to the Hard Problem: insight into the structural necessities of phenomenal representation
- What has led to representational convergence, will it continue, and ultimately where does it end?question0.790Central motivating questions of the paper