concept
active
concept:hydra-effectHydra Effect
Phenomenon where layer ablations trigger silent downstream compensation, contrasted with ESR
Neighborhood — ranked by edge-count
Claims (1)
claim
- Distinguishes ESR from prior work on model self-repair
Concepts (1)
concept
- Endogenous Steering ResistancecontradictsThe central phenomenon introduced by this paper: inference-time recovery from irrelevant activation steering in LLMs
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Tiny differences in initial conditions can lead to vastly divergent outcomes, explaining why living structure must be created dynamically.
- An emergent ordering created by a well-arranged structure of centers of different sizes, binding them into a whole.
- Phenomenon where a moment bleeds into adjacent moments temporally; includes deja vu as reversed direction.
- The phenomenon where naive or vulnerable users perceive dialogue agents as having human-like desires and feelings, putting them at risk of emotional manipulation
- Unintended changes in model behavior when performing edits; compared between VPD editing and fine-tuning.
- The nourishment and increased wholeness experienced by a maker when creating living structure.