method
active
method:crowdworker-model-comparison-testsCrowdworker model comparison tests
Procedure where crowdworkers compare responses from two models and indicate preference, used to compute Elo scores.
Neighborhood — ranked by edge-count
Methods (1)
method
- Elo scoreimplementsA rating system used to compare model helpfulness and harmlessness based on crowdworker preferences.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The more general, daily-use version of the mirror-of-self test: asking which of A or B induces greater feeling of wholeness in the observer
- Probe construction method: concept vector at each layer is L2-normalized difference between mean positive and mean negative representations from contrastive system prompts
- A practical test to determine if center B helps center A by comparing the life of A with and without B.
- Novel task asking which of two sentences received a stronger injection, using matched-pairs design to control for positional bias
- The iterative method Alexander uses to make design decisions: compare two versions and ask which is more a picture of one's own eternal self, repeating until convergence.
- Method of testing truss appearance from below by building a full-scale cardboard mockup to check visual correctness.