finding
active
finding:in-llama-2-13b-cities-and-neg-cities-show-approximately-orthogonal-axes-of-separation-in-pca-visualizations-at-intermediate-layers

In LLaMA-2-13B, cities and neg_cities show approximately orthogonal axes of separation in PCA visualizations at intermediate layers

Case of misalignment showing that the truth direction is not always shared between a dataset and its negation in smaller models

Neighborhood — ranked by edge-count

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.