finding
active
finding:embedding-based-construct-classifiers-achieve-mean-accuracy-and-f1-macro-of-95-96-across-ocean-hexaco-dark-tetrad-cmni-cfni-constructsEmbedding-based construct classifiers achieve mean accuracy and F1-macro of 95.96% across OCEAN, HEXACO, Dark Tetrad, CMNI, CFNI constructs
Validates use of lightweight classifiers as replacement for frontier LLM evaluation during alpha sweeps
Source paper
extracted_from(2026) · Leonardo Blas · Robin Jia · Emilio Ferrara
Neighborhood — ranked by edge-count
Methods (1)
method
- Logistic regressor on Qwen3Embedding-0.6B embeddings trained on construct statements; used to measure construct presence in alpha sweeps
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Psychopathy construct classifier achieves 90.50% accuracy, lowest among all evaluated constructsfinding0.759Lowest individual classifier performance
- Highest individual classifier performance among OCEAN constructs
- Core result of Experiment 3: cross-model semantic convergence under self-referential processing
- Table 2, row 3, showing equivalence when prior preferences match rewards.
- Within-family factual generalization (F0-F2) is consistently strong across all models and prompt settings.finding0.745Establishes a reliable baseline for factual truth direction universality within simple factual recall.
- Validates theoretical PMI convergence claim on real data
- Validates the statement synthesis pipeline as producing behavior-specific content comparable to established methods
- Quantifies performance cost of fine-tuning and steering; deployment steering has minimal accuracy cost.