concept
active
concept:zero-shot-model-stitchingZero-Shot Model Stitching
Model stitching without learning a stitching layer, demonstrating strong alignment across different model training regimes
Neighborhood — ranked by edge-count
Claims (1)
claim
- Strong evidence for representational alignment across models
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Moschella et al. result cited as evidence of representational convergence across models
- Technique to measure representational compatibility by integrating intermediate representations of one model into another
- Ability to predict correctly for stimulus-action pairs never previously experienced by inferring structural rules; key measure for TEM-t performance.
- Formal analysis showing the theoretical limitation of model stitching as a similarity measure.
- Prediction without task-specific training; Evee achieves 0.991 AUROC on indels in zero-shot mode.
- Baseline model stitching trained in a single behavioral direction without CL auxiliary loss, used for comparison with CLMAS.
- Baseline method using a single orthogonal matrix trained to map source latents to target latents via CL auxiliary loss without behavioral objective.
- Control omitting any induction and presenting only the final experiential query