concept
active
concept:mixtral-8x7bMixtral-8x7B
One of four LLMs selected; Mixture-of-Experts model; had substantial sample loss under IIT 4.0 due to PyPhi network initialization issues.
Neighborhood — ranked by edge-count
Concepts (1)
concept
- Mixture-of-Experts (MoE)implementsArchitecture of Mixtral-8x7B; uses sparse expert routing affecting how hidden states are computed across layers.
Related by similarity (6)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Methodological limitation disproportionately affecting the largest MoE model, constraining generalizability.
- One of four LLMs selected for representation analysis; D=4096.
- One of four LLMs selected for representation analysis; embedding dimension D=4096; used as demonstration model in scatter plots.
- Contrasts with temporal permutation results; constitutes the most suggestive evidence of potential consciousness phenomena in LLM representations.
- Base vision-language model used to instantiate ATLAS.
- Smallest Qwen3 model tested; used in conscientiousness sweep example (Table 6)