concept
active
concept:qwen3-235b-a22bQwen3-235B-A22B
Large open-source model used as anchor agent and anchor evolver; illustrates benchmark-dependent evolver performance
Neighborhood — ranked by edge-count
Concepts (1)
concept
- Qwen3-32Brelated_toWeak-tier open-source model exhibiting both harness activation failure and adherence failure, with 25.1% skill-load rate
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- 14B Qwen3 model quantized to 4-bit NF4; tested in OCEAN benchmarks
- Smallest Qwen3 model tested; used in conscientiousness sweep example (Table 6)
- Smallest model tested as evolver; produces harness updates comparable to Claude Opus 4.6 on SkillsBench
- 4B Qwen3 model tested in OCEAN benchmarks
- Base vision-language model used to instantiate ATLAS.
- Embedding model used to embed user messages for ridge regression analysis of persona drift causes
- Qwen 35B (3B active params, score 4.38) outscores Hermes 405B (405B active params, score 1.75) by 2.5xfinding0.728Parameters don't predict scores; 135x more parameters yields 60% lower score
- Demonstrates that harness loading is necessary but not sufficient for harness benefit; cleanest separation of activation and adherence