artifact
active
artifact:github-com-agencyenterprise-endogenoussteering-resistance

github.com/agencyenterprise/endogenoussteering-resistance

Code repository released with the paper for reproducibility

Neighborhood — ranked by edge-count

Concepts (1)

concept
  • The central phenomenon introduced by this paper: inference-time recovery from irrelevant activation steering in LLMs