artifact
active
artifact:github-com-agencyenterprise-endogenoussteering-resistancegithub.com/agencyenterprise/endogenoussteering-resistance
Code repository released with the paper for reproducibility
Neighborhood — ranked by edge-count
Concepts (1)
concept
- The central phenomenon introduced by this paper: inference-time recovery from irrelevant activation steering in LLMs