community
active
leiden_hybrid_papers
label: sonnet
community:leiden_hybrid_papers-run3-c1

LLM interpretability & self-awareness

Methods for probing, explaining, and evaluating internal representations and reflective behaviors in large language models.

12 members. Each node is clickable.

Loading graph…