question
active
question:why-did-mass-mean-probing-with-cities-neg-cities-training-data-perform-poorly-for-the-70b-model-despite-larger-than-smaller-than-performing-well

Why did mass-mean probing with cities+neg_cities training data perform poorly for the 70B model, despite larger_than+smaller_than performing well?

Open question about scale-dependent asymmetry in training data effects

Neighborhood — ranked by edge-count

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.