concept
active
concept:the-llama-3-herd-of-models-grattafiori-et-al-2024The Llama 3 Herd of Models (Grattafiori et al., 2024)
Paper describing Llama 3 model family used in this study
Neighborhood — ranked by edge-count
Papers (1)
paper
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Family of autoregressive transformer models used in all experiments; 7B, 13B, and 70B parameter sizes
- Language model family used in cross-modal alignment experiments across multiple sizes
- Model-specific difference in persona susceptibility
- Llama-3.3-70B exhibits internal consistency-checking mechanisms that operate during inferenceclaim0.752Central interpretive claim of the paper supported by causal ablation and activation evidence
- Meta's open large language model cited as an example of the class of models under discussion
- One of four LLMs selected for representation analysis; embedding dimension D=4096; used as demonstration model in scatter plots.
- One of four LLMs selected; larger model with D=8192 embedding dimension; analyzed across proportionally aligned layers.
- Scaling Laws for Activation Steering with Llama 2 Models and Refusal Mechanisms (Ali et al., 2025)concept0.741Related work finding larger models more resistant to steering, potentially consistent with ESR in 70B