finding
active
finding:meta-prompt-esr-enhancement-effects-scale-with-model-size-across-llama-and-gemma-familiesMeta-prompt ESR enhancement effects scale with model size across Llama and Gemma families
Suggests underlying self-monitoring circuits must be present for meta-prompting to enhance them
Source paper
extracted_from(2026) · Alex McKenzie · Keenan Pepper · Stijn Servaes · Martin Leitgab +5
Neighborhood — ranked by edge-count
Claims (1)
claim
- Mechanistic interpretation of why meta-prompting effects scale with model size
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Demonstrates ESR can be deliberately enhanced through prompting in the largest model
- Appending instructional meta-prompts to object-level prompts to deliberately enhance ESR in models
- Cross-judge validation of the primary ESR finding across OpenAI, Alibaba, Anthropic, and Google judge models
- Characterizes the narrow operating window in which ESR can manifest
- Shows the instruction effect, while shifting geometry, may not produce consistent generalization effects across model families.
- We cannot isolate whether ESR reflects scale, architecture, or training procedures in Llama-3.3-70Bclaim0.766Epistemic limitation claim acknowledging confounds in the cross-model comparison
- Establishes generalizability of the core difficulty-boundary finding across model families.
- Establishes potential Llama-family specificity or scale specificity of ESR phenomenon