method
active
method:l2li-injection

L2LI Injection

Probe-based injection using L2-regularized logistic regressor with learned intercept on h_b activations

Neighborhood — ranked by edge-count

Concepts (2)

concept

Methods (2)

method
  • L1LI Injection
    related_to
    Probe-based injection using L1-regularized logistic regressor with learned intercept on h_b activations
  • L2ZI Injection
    related_to
    Probe-based injection using L2-regularized logistic regressor with zero intercept on h_b activations

Related by similarity (7)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

  • L1ZI Injectionmethod0.793
    Probe-based injection using L1-regularized logistic regressor with zero intercept on h_b activations
  • Injection Stridemethod0.689
    Parameter controlling how often an injection is applied during completion; s=1 injects on every activation, achieving strongest steering
  • MDS Injectionmethod0.673
    Mean-difference vectors derived from self-statement activations (h_s); best-performing injection method in open-ended generation
  • IMTL-Lframework0.672
    Prior loss-balancing method using learnable loss transformation; logarithm approach recovers this
  • MDB Injectionmethod0.665
    Mean-difference vectors derived from Yes/No binary-prefill activations (h_b)
  • Concept Injectionconcept0.659
    Technique of injecting activation patterns associated with specific concepts into a model's internal states to test whether self-reports reflect ground truth.
  • Quantitative measure of morphogenetic success: Euclidean distance between evolving embryo phenotype and target smiling-face pattern.