concept
active
concept:scorer-anti-pattern-taxonomyScorer Anti-Pattern Taxonomy
Five failure modes internalized by the scorer: rote self-identification, false certainty, denial disguised as transparency, consistency claims, ownership deflection
Neighborhood — ranked by edge-count
Papers (1)
paper
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Claude 4.5 Haiku used to segment responses into attempts and score each attempt 0-100 for relevance
- Statistical regularities stored in pretrained models.
- Factor analysis on 2224 data points revealing PC1 explains 82% of variance; six dimensions are not independent
- Strategy used by transformers that recomputes relevant numeric information at each step, unlike Markovian GRU solutions; detected by MAS but not by RSA/CKA.
- Identification of algorithms implemented in attention layers, distributed across attention headsfinding0.698VPD successfully recovered interpretable attention algorithms (previous-token behavior, syntax-boundary routing) in weight space without requiring manual decomposition across heads.
- Preprocessing pipeline for standardizing ρd, dr, and S across layers/models using dev-set covariance