institute
active
institute:eleos-aiEleos AI
Research organization focused on AI welfare; employing several authors.
Neighborhood — ranked by edge-count
Thinkers (3)
thinker
- Kyle Fishaffiliated_with
- Robert Longaffiliated_with
- Kathleen Finlinsonaffiliated_with
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Discord mentioned where Gwern spoke.
- Future AI that may be rational, autonomous, and possibly conscious but lack affective consciousness.
- Developer of Mistral models, mentioned as 'horrible' but large enough for threshold effects.
- The project of ensuring AI systems do not harm humans (and other animals); sometimes in tension with AI welfare.
- Alignment approach by Anthropic that explicitly trains self-observation; predicts highest baseline and lowest prompt lift.
- Higher-level systems built on top of LLMs that produce and consume representations beyond next-token prediction; proposed as potential candidates for consciousness.
- Field within which this work has implications for evaluating alignment progress.