concept
active
concept:qwen3-4bQwen3-4B
4B Qwen3 model tested in OCEAN benchmarks
Neighborhood — ranked by edge-count
Concepts (4)
concept
- Qwen3-1.7Brelated_toSmallest Qwen3 model tested; used in conscientiousness sweep example (Table 6)
- Qwen3.5-9Brelated_toSmallest model tested as evolver; produces harness updates comparable to Claude Opus 4.6 on SkillsBench
- Qwen3-32Brelated_toWeak-tier open-source model exhibiting both harness activation failure and adherence failure, with 25.1% skill-load rate
- Qwen3-14Brelated_to14B Qwen3 model quantized to 4-bit NF4; tested in OCEAN benchmarks
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Large open-source model used as anchor agent and anchor evolver; illustrates benchmark-dependent evolver performance
- Embedding model used to embed user messages for ridge regression analysis of persona drift causes
- Base vision-language model used to instantiate ATLAS.
- One of four LLMs selected for representation analysis; embedding dimension D=4096; used as demonstration model in scatter plots.
- Qwen 35B (3B active params, score 4.38) outscores Hermes 405B (405B active params, score 1.75) by 2.5xfinding0.694Parameters don't predict scores; 135x more parameters yields 60% lower score
- Quantization applied to LLMs above 12B parameters to enable evaluation on available hardware
- Model-specific difference in how steered personas manifest
- Quantifies harness activation failure for weak-tier models vs. strong-tier models