quote

active

quote:i-don-t-follow-ai-alignment-research-in-any-depth-but-i-am-noticing-a-striking-disconnect-between-the-concepts-appearing-in-those-discussions-and-recent-advances-in-ai-especially-gpt-3

I don't follow AI alignment research in any depth, but I am noticing a striking disconnect between the concepts appearing in those discussions and recent advances in AI, especially GPT-3.

Quote from a question that sparked the post, highlighting the gap between theory and practice.

Source paper

extracted_from

Simulators — LessWrong

Neighborhood — ranked by edge-count

Frameworks (1)

framework

Agentic AI ontology
associated_with
The traditional alignment framework focusing on agents optimized to pursue goals.

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Humanity has never solved the alignment problem between generations of humans, so AI alignment debates are not novel — they reflect a perennial unresolved problem.claim0.812
Deflates the novelty of AI alignment by pointing to its structural identity with intergenerational value transmission
How Can We Ensure Alignment Between Artificial Intelligencequestion0.808
AI alignmentconcept0.779
Field within which this work has implications for evaluating alignment progress.
Many recent discussions of AI, and its impact on individuals and on society, are importantly incomplete because they neglect Diverse Intelligence, synthetic morphology, and developmental biology.claim0.773
Central thesis of the paper — the framing premise from which all other arguments follow
AI Alignment and Safetyconcept0.771
The broader domain for which ESR has dual implications: resistance to adversarial manipulation vs. interference with safety interventions
Recent AI advances are revolutionary but far from recapitulating human-like cognitive abilities or consciousness.claim0.769
A General Language Assistant as a Laboratory for Alignment (Askell et al. 2021)concept0.768
HHH training framework that Claude was trained with prior to experiments
The existential concerns raised about AI — alignment, control, value drift, supplanting — are not new and are precisely the concerns humanity has always faced in having children.claim0.767
Key rhetorical and philosophical argument establishing continuity between AI concerns and child-rearing