finding
active
finding:phi-4-shows-u-shaped-cohesion-with-falling-mismatch-peak-depth-varies-by-modelPhi-4 shows U-shaped cohesion with falling mismatch; peak depth varies by model
E3 backbone-specific finding showing three-stage trajectory generalizes across architectures
Source paper
extracted_from(2025) · Edward Yi Chang · Kaya, Zeyneb N. · Ethan Chang
Neighborhood — ranked by edge-count
Concepts (1)
concept
- Three-Stage Layer TrajectorysupportsEmpirically observed pattern in E3: early enrichment (ρd dips), mid-layer alignment (dr falls), late standardization (re-clustering)
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Supported by the geometric transition visible in cosine similarity heatmaps for F0-F3.
- Geometric evidence for convergence to stable truth directions only for simpler tasks.
- Reflection-inducing directions emerge more clearly in higher layers (ℓ>5) for both models and datasetsfinding0.754Empirical observation about which network layers encode reflection-relevant information.
- Heavy alignment includes both CAI (low lift) and heavy-RLHF (high lift); predictor is alignment type not depth
- Shot midpoints follow k50 ∝ dr/ρd; higher cohesion and lower mismatch yield fewer required examplesclaim0.751Core quantitative prediction of UCCT validated by E2 threshold ordering
- Truth-related directions reliably emerge at 60–75% of normalized layer depth in Qwen and Gemma modelsfinding0.750Experiment 1 finding localizing where truth can be causally mediated
- Demonstrates that early-layer probes capture sentence polarity rather than truth.
- Gemma-3-4B-it shows three-stage layer trajectory and S(ℓ) peak despite scale differences in dr and ρdfinding0.746E3 backbone generalization finding for Gemma; validates pattern across diverse architectures