claim
active
claim:probe-based-ranking-outperforms-gradient-based-and-llm-judge-methods-at-lower-cost

Probe-based ranking outperforms gradient-based and LLM-judge methods at lower cost

Authors' claim that their approach is both more effective in reduction and cheaper than prior methods.

Source paper

extracted_from
Probe-Based Data Attribution: Surfacing and Mitigating Undesirable Behaviors in LLM Post-Training
(2026) · Frank Xiao · Santiago Aranguri

Neighborhood — ranked by edge-count

Findings (1)

finding

Communities (3)

community

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.