finding
active
finding:gemma-3-4b-it-shows-three-stage-layer-trajectory-and-s-l-peak-despite-scale-differences-in-dr-and-dGemma-3-4B-it shows three-stage layer trajectory and S(ℓ) peak despite scale differences in dr and ρd
E3 backbone generalization finding for Gemma; validates pattern across diverse architectures
Source paper
extracted_from(2025) · Edward Yi Chang · Kaya, Zeyneb N. · Ethan Chang
Neighborhood — ranked by edge-count
Concepts (1)
concept
- Three-Stage Layer TrajectorysupportsEmpirically observed pattern in E3: early enrichment (ρd dips), mid-layer alignment (dr falls), late standardization (re-clustering)
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Geometric evidence for convergence to stable truth directions only for simpler tasks.
- Experiment 4 result showing DIM captures only one facet of the multi-dimensional truth subspace
- Truth-related directions reliably emerge at 60–75% of normalized layer depth in Qwen and Gemma modelsfinding0.773Experiment 1 finding localizing where truth can be causally mediated
- Weaker cross-family probe; explains weaker introspection in Gemma
- Do layer-wise geometric signatures (τ_peak, AUS_N) correlate with behavioral thresholds (k50)?question0.762E3 research question testing whether internal representations provide a geometry-to-behavior bridge
- Gemma-2-27B attention layer Latent SOO MSE reduced from 11 to 7.67 ± 0.77 after SOO fine-tuningfinding0.760SOO fine-tuning reduced attention layer MSE in Gemma-2-27B though MLP layers showed no significant change
- Establishes generalizability of the core difficulty-boundary finding across model families.
- SOO fine-tuning did not collapse Gemma-2-27B self-other distinction needed for perspective-taking