concept
active
concept:the-hydra-effect-emergent-self-repair-in-language-model-computations-mcgrath-et-al-2023

The Hydra Effect: Emergent Self-Repair in Language Model Computations (McGrath et al., 2023)

Related work on model self-repair, contrasted with ESR which involves explicit active correction

Neighborhood — ranked by edge-count

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.