concept
pending-review
concept:post-training-alignmentPost-training alignment
xiao-aranguri-probe-data-attribution-2026.mdFrontmatter (11 fields)
{
"author": null,
"context": "Broader research area: methods to align model behavior after initial training, where undesired behaviors can emerge.",
"category": "ai",
"enrichment": {
"is_stale": true
},
"norm_label": "Post-training alignment",
"source_url": null,
"graphify_id": "post_training_alignment",
"source_file": "xiao-aranguri-probe-data-attribution-2026.md",
"imported_from": "/Users/antonborzov/Documents/Research.nosync/papers/extract_typed_out/xiao-aranguri-probe-data-attribution-2026/graph.json",
"extracted_type": "concept",
"graphify_file_type": "concept"
}Outgoing (0)
None.
Incoming (1)
about (1)
- Probe-Based Data Attribution(method)
Mentions (1)
- papers-typed
xiao-aranguri-probe-data-attribution-2026.md