Deontological optimization

Predictive accuracy applies pressure directly on actions rather than consequences, avoiding instrumental convergence.

Neighborhood — ranked by edge-count

artifact

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Deontological Ethicsconcept0.840
Ethical theories often holding that total resource transfer to super-beneficiaries would be supererogatory or impermissible
Direct Preference Optimizationframework0.754
Post-training alignment method during which undesirable behaviors emerged in the studied model.
Multi-objective optimizationconcept0.750
Framework for optimizing multiple objectives simultaneously, used in MTL.
Deep Optimizationframework0.746
optimization pressureconcept0.745
The force of gradient-based learning on structured data that drives networks to organize their representations into geometric structures.
Pareto optimalityconcept0.723
Trade-off concept where no metric can be improved without worsening another.
Proximal Policy Optimizationmethod0.722
RL algorithm used for training models to comply with the conflicting objective
Deliberative Alignmentframework0.721
OpenAI's approach integrating chain-of-thought reasoning into alignment; parallels contemplative self-monitoring