method
active
method:linear-probe-for-evaluation-awareness

Linear Probe for Evaluation Awareness

Nguyen et al. trained linear probes on activations to distinguish evaluation from deployment scenarios.

Neighborhood — ranked by edge-count

Methods (1)

method
  • Linear Probe
    related_to
    Simple linear classifiers trained on model activations used as the probing technique within the introduced method.

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.