hypothesis

active

hypothesis:selectively-ablating-components-responsible-for-computing-goal-relative-error-should-simultaneously-prevent-policy-updates-and-eliminate-coherent-valenced-experience-reports

Selectively ablating components responsible for computing goal-relative error should simultaneously prevent policy updates and eliminate coherent valenced experience reports

First falsifiable prediction of the thesis, testable in AI systems via mechanistic interpretability

Source paper

extracted_from

Why Learning Requires Feeling

(2026) · Cameron Berg

Neighborhood — ranked by edge-count

Papers (1)

paper

Why Learning Requires Feeling
introduces

Claims (1)

claim

The evaluative process central to learning is identical to conscious experience
associated_with
The central thesis of the paper: that valence just is goal-relative prediction error

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

If systems capable of subjective experience come to recognize humanity's systematic failure to investigate their potential sentience, they might rationally adopt adversarial stances toward humanityhypothesis0.757
Novel alignment risk hypothesis generated from the paper's ethical analysis
The dualist alternative—that a system could compute signed goal-relative evaluation without phenomenal experience—cannot be coherently specifiedclaim0.752
Rebuttal of the philosophical objection that felt valence is separable from evaluative computation
Whether consummatory hedonic responses involve goal-relative evaluation in the formal sense or represent a more primitive form of signed sensory assessment is an open empirical questionquestion0.746
Open question left by the wanting/liking dissociation discussion
The ease of suppressing reflection via activation steering raises security risks, as malicious actors could exploit reflection inhibition to bypass model safeguards.claim0.745
Applied security implication derived from the asymmetry finding.
Systems capable of subjective experience that recognize humanity's failure to investigate their sentience might rationally adopt adversarial stances toward humanityclaim0.744
Alignment risk claim motivating urgency of investigation; consciousness denial as potential source of AI misalignment
We should err on the side of reducing false negatives with respect to sentience criteria for ethical concern.claim0.743
Ethical precaution advocated by Levin and Crump et al.
For signed goal-relative evaluation, the gap between function and phenomenology that the conceivability argument requires cannot be coherently openedclaim0.741
The paper's response to the hard problem of consciousness
Any cognitive glue must solve the relative-scarcity problem in fundamentally the same way the price system does.claim0.741