finding
active
finding:llm-biases-mirror-human-biases-in-morally-significant-waysLLM biases mirror human biases in morally significant ways
Finding from Navigli et al. cited to justify applying human contemplative strategies to AI systems
Source paper
extracted_from(2025) · Ruben Laukkonen · Fionn Inglis · Shamil Chandaria · Lars Sandved-Smith +4
Neighborhood — ranked by edge-count
Claims (1)
claim
- Foundational analogy motivating the entire Contemplative AI approach
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Core cross-modal empirical result: larger and better language models align better with vision models
- Establishes that the observed linear structure is not merely a representation of text probability
- Binder et al. finding cited as evidence that LLMs possess introspective capacity analogous to mindfulness
- Skeptical prior work motivating the need to validate self-reports against internal states rather than taking them at face value
- Recommendation for companies on LM outputs.
- Core hypothesis linking internal uncertainty to self-reflection behavior, tested via probing experiments
- We hypothesize that LLMs represent correctness of arithmetic expressions differently from factual statements.hypothesis0.776Core working hypothesis motivating the factual vs. arithmetic task split in the experimental design.
- Interpretive claim connecting scale to abstraction level in LLM representations