community
active
leiden_hybrid_concepts
label: haiku
community:leiden_hybrid_concepts-run4-c0-c2

Mechanistic introspection in language models

Empirical investigation of how LMs access and report internal states across layers, using concept injection and thought detection on Claude models.

23 members. Each node is clickable.

Loading graph…

Claims (14)

Findings (9)