finding
active
finding:the-representation-based-path-and-the-behavior-based-path-in-llama-3-1-8b-activation-space-trace-out-similar-curves-demonstrating-bidirectional-geometry-alignment

The representation-based path and the behavior-based path in Llama-3.1 8B activation space trace out similar curves, demonstrating bidirectional geometry alignment.

Key empirical result showing that optimizing for behavioral outputs and fitting representation geometry produce the same path in activation space.

Neighborhood — ranked by edge-count

Claims (2)

claim

Hypotheses (1)

hypothesis

Concepts (2)

concept
  • The path in activation space derived by optimizing steering interventions to produce outputs along the behavior manifold, independent of representation geometry.
  • The path in activation space derived by fitting the representation manifold, used to steer along the geometric structure of internal representations.

Questions (1)

question

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.