method
active
method:mutual-k-nearest-neighbor-alignment-metricMutual k-Nearest Neighbor Alignment Metric
Primary alignment metric used in experiments; measures mean intersection of k-nearest neighbor sets between two kernels
Neighborhood — ranked by edge-count
Papers (1)
paper
- The Platonic Representation Hypothesisintroducesuses
Findings (2)
finding
- Shows cross-modal alignment is primarily local rather than global
- Validates robustness of alignment metric choice
Concepts (1)
concept
- Representational AlignmentimplementsMeasure of similarity between the similarity structures (kernels) induced by two different representations
Methods (2)
method
- Centered Kernel AlignmentextendsStandard alignment metric cited and compared against; measures global kernel similarity between representations
- Modified CKA metric that restricts cross-covariance to nearest neighbors; introduced in this paper's appendix
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Open question the authors leave unresolved about interpreting the magnitude of their alignment measurements
- Open methodological question acknowledged as limitation
- The theoretical cap on cross-modal alignment determined by mutual information between input signals and model capacity
- The bijective function mapping DNN inner neurons to latent variables in causal abstraction; its complexity is the central variable studied
- The goal of making model behavior match human values and intentions, often addressed during post-training.
- Alignment approach that focuses on curating or modifying training data; the paper bridges this with interpretability methods.
- Quantitative bound on observed alignment; raises the open question of whether this gap reflects noise or real misalignment
- Baseline method that exhaustively searches discrete spaces of localist alignments between high-level variables and neuron groups.