finding
active
finding:sauers-statistical-anomaly-when-models-are-given-janus-post-explaining-transformers-reconstruction-accuracy-tails-extend-both-ways-with-1-1000-reconstructions-anomalously-accurate

Sauers' statistical anomaly: when models are given Janus post explaining transformers, reconstruction accuracy tails extend both ways, with ~1/1000 reconstructions anomalously accurate

Statistically rigorous analysis of Claude introspection; suggests models may have latent introspective capabilities that can be enhanced or disrupted.

Source paper

extracted_from
Anima Labs Phenomenology Pt1

Neighborhood — ranked by edge-count

Claims (2)

claim

Artifacts (1)

artifact

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.