finding
active
finding:efe-decrease-after-sticker-removal-is-statistically-significant-wilcoxon-p-6-33-10-9-across-80-evaluationsEFE decrease after sticker removal is statistically significant (Wilcoxon p = 6.33×10⁻⁹) across 80 evaluations
Confirms that EFE systematically decreases after sticker removal, validating the self-prior as internal criterion
Source paper
extracted_from(2026) · Dongmin Kim · Hoshinori Kanazawa · Yasuo Kuniyoshi
Neighborhood — ranked by edge-count
Claims (2)
claim
- The self-prior operates as an internal criterion for distinguishing self from non-self, without external reward or explicitly computed sticker locationassociated_withsupportsCentral interpretive claim of the paper, supported by EFE decrease after sticker removal
- Core mechanism claim linking mismatch detection to behavior through EFE minimization
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Baseline EFE when sticker is present, used for comparison
- Qualitative confirmation of EFE drop in trained model vs. untrained model (Δ = +1.70)
- Control showing that the EFE signal is learned, not inherent to the architecture
- Suggests the agent learned to recognize and approach the sticker before achieving reliable removal
- Shows learning progression from chance-level to functional behavior
- Agent achieves approximately 70% sticker-removal success rate by end of 500k training stepsfinding0.740Main behavioral result demonstrating the model's efficacy in the mirror-mark task
- Connects the model's behavior to Zaadnoordijk and Bayne's taxonomy of intentional agency
- Random latent ablation produces slight increase in ESR rate (3.8% to 4.2%), not statistically significantfinding0.733Control result confirming OTD ablation effect is specific to those latents, not a general ablation artifact