question
pending-review
question:can-natural-language-explanations-of-activations-generated-through-unsupervised-reconstruction-genuinely-capture-model-cognitionCan natural language explanations of activations generated through unsupervised reconstruction genuinely capture model cognition?
natural.mdFrontmatter (10 fields)
{
"doc": "natural.md",
"context": "Core research question motivating NLA development and validation through case studies and causal interventions.",
"category": "ai",
"norm_label": "Can natural language explanations of activations generated through unsupervised reconstruction genuinely capture model cognition?",
"graphify_id": "question_nla_interpretability",
"source_file": "natural.md",
"imported_from": "/Users/antonborzov/Documents/Research.nosync/papers/extract_typed_out/natural/graph.json",
"extracted_type": "question",
"source_location": "§Introduction",
"graphify_file_type": "question"
}Outgoing (0)
None.
Incoming (1)
gates (1)
- Unverbalized Evaluation Awareness(concept)
Mentions (1)
- papers-typed
natural.md