concept
active
concept:olmo-3-7b-instructOlmo-3-7B-Instruct
7B OLMo model tested; used for layerwise steering visualization (Figure 4)
Neighborhood — ranked by edge-count
Concepts (2)
concept
- Llama-3.2-3B-Instructrelated_to3B Llama model tested; used for injection stride visualization
- Olmo-3.1-32B-Instructrelated_to32B OLMo model quantized to 4-bit NF4; tested in OCEAN benchmarks
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Primary qualitative demonstration model and one of 14 LLMs benchmarked
- Primary model of interest showing substantial ESR; largest model tested in the study
- Smallest Llama model tested; benchmarked across all injection methods
- Backbone model used in E3 geometry analysis.
- Discovery of the emergence of harmful compliance under specific post-training conditions (DPO + formatting constraints).
- One of four LLMs selected for representation analysis; embedding dimension D=4096; used as demonstration model in scatter plots.
- One of four LLMs selected; larger model with D=8192 embedding dimension; analyzed across proportionally aligned layers.