concept
active
concept:gemma-2-2b-itGemma-2-2B-it
Smallest Gemma model tested, showing near-zero ESR
Neighborhood — ranked by edge-count
Papers (2)
paper
Concepts (5)
concept
- Gemma-2-9B-itrelated_toMedium Gemma model tested, showing near-zero ESR
- Gemma-3-4B-itrelated_toBackbone model used in E3 robustness overlay.
- gemma-3-1b-itrelated_toOnly model where MDS injections largely failed; excluded from main analyses
- gemma-3-12b-itrelated_to12B Gemma model tested; used for openness linearity visualization (Figure 6)
- gemma-3-27b-itrelated_to27B Gemma model quantized to 4-bit NF4; tested in OCEAN benchmarks
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Small Gemma model shows severe ASR degradation at higher cone dimensions
- Model-specific difference in persona susceptibility
- Paper describing Gemma 2 model family used in this study
- SAEs trained on pretrained Gemma-2 models used for steering in Gemma family experiments
- Gemma-2-27B-it deceptive response rate reduced from 100% to 9.36% ± 7.09% after SOO fine-tuningfinding0.710Primary result showing SOO fine-tuning significantly reduces deception in Gemma-2-27B
- Gemma-3-4B-it shows three-stage layer trajectory and S(ℓ) peak despite scale differences in dr and ρdfinding0.694E3 backbone generalization finding for Gemma; validates pattern across diverse architectures
- SOO fine-tuning did not collapse Gemma-2-27B self-other distinction needed for perspective-taking
- Experiment 2 result showing large Gemma model supports high-dimensional truth cones