method
active
method:dose-response-feature-steering-protocol

Dose-Response Feature Steering Protocol

Varying each feature's activation from -0.6 to +0.6, averaging over 10 random seeds per setting

Neighborhood — ranked by edge-count

Frameworks (1)

framework
  • Method of adding scaled versions of sparse autoencoder latent features during generation to causally modulate model behavior

Artifacts (1)

artifact

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.