concept
active
concept:h-b-activations-yes-no-binary-prefill

h_b Activations (Yes/No Binary Prefill)

Residual-stream activations extracted by prefilling with Yes/No response to identity statement; achieves perfect probe separability

Neighborhood — ranked by edge-count

Methods (3)

method
  • Probe-based injection using L1-regularized logistic regressor with learned intercept on h_b activations
  • Probe-based injection using L2-regularized logistic regressor with learned intercept on h_b activations
  • Mean-difference vectors derived from Yes/No binary-prefill activations (h_b)

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.