method
active
method:attention-head-localization-analysis

attention head localization analysis

Analysis measuring whether each attention head's maximum attention increase points to the correct injected sentence

Neighborhood — ranked by edge-count

Concepts (1)

concept
  • Mechanism by which attention heads detect injected perturbations and route information about them to the final token position

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.