LaMDA: Language Models for Dialog Applications

ByRomal Thoppilan·Daniel De Freitas·Jamie Hall·Noam Shazeer·Apoorv Kulshreshtha·Heng-Tze Cheng+4 more

DOI 10.48550/arxiv.2201.08239 arXiv 2201.08239 OpenAlex W4226399820

Original abstract (expand)

We present LaMDA: Language Models for Dialog Applications. LaMDA is a family of Transformer-based neural language models specialized for dialog, which have up to 137B parameters and are pre-trained on 1.56T words of public dialog data and web text. While model scaling alone can improve quality, it shows less improvements on safety and factual grounding. We demonstrate that fine-tuning with annotated data and enabling the model to consult external knowledge sources can lead to significant improvements towards the two key challenges of safety and factual grounding. The first challenge, safety, involves ensuring that the model's responses are consistent with a set of human values, such as preventing harmful suggestions and unfair bias. We quantify safety using a metric based on an illustrative set of human values, and we find that filtering candidate responses using a LaMDA classifier fine-tuned with a small amount of crowdworker-annotated data offers a promising approach to improving model safety. The second challenge, factual grounding, involves enabling the model to consult external knowledge sources, such as an information retrieval system, a language translator, and a calculator. We quantify factuality using a groundedness metric, and we find that our approach enables the model to generate responses grounded in known sources, rather than responses that merely sound plausible. Finally, we explore the use of LaMDA in the domains of education and content recommendations, and analyze their helpfulness and role consistency.

Related work— refs + corpus + external arXiv

Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.

SDialog: A Python Toolkit for End-to-End Agent Building, User Simulation, Dialog Generation, and Evaluation
S\'everin Baroudi, Yanis Labrak, David Grunert, Pawel Cyrta, Yiyang Chen, Srikanth Madikeri, Thomas Schaaf, Esa\'u Villatoro-Tello, Ahmed Hassoon, Ricard Marxer, Petr Motlicek Sergio Burdisso
2026
≈ 73%
A Survey of Large Language Models
Kun Zhou, Junyi Li, Tianyi Tang, Xiaolei Wang, Yupeng Hou, Yingqian Min, Beichen Zhang, Junjie Zhang, Zican Dong, Yifan Du, Chen Yang, Yushuo Chen, Zhipeng Chen, Jinhao Jiang, Ruiyang Ren, Yifan Li, Xinyu Tang, Zikang Liu, Peiyu Liu, Jian-Yun Nie and Ji-Rong Wen Wayne Xin Zhao
2026
≈ 72%
A Pattern Language for Resilient Visual Agents
Alexander Lenz, Alois Knoll Habtom Kahsay Gidey
2026
≈ 72%
SDialog: A Python Toolkit for End-to-End Agent Building, User Simulation, Dialog Generation, and Evaluation
S\'everin Baroudi, Yanis Labrak, David Grunert, Pawel Cyrta, Yiyang Chen, Srikanth Madikeri, Esa\'u Villatoro-Tello, Thomas Schaaf, Ricard Marxer, Petr Motlicek Sergio Burdisso
2025
≈ 71%
RLang: A Declarative Language for Describing Partial World Knowledge to Reinforcement Learning Agents
Benjamin A. Spiegel, Jennifer Wang, Roma Patel, Stefanie Tellex and George Konidaris Rafael Rodriguez-Sanchez
2023
≈ 71%
LLMs-Healthcare : Current Applications and Challenges of Large Language Models in various Medical Specialties
Awais Ahmed, Summaya Mumtaz Ummara Mumtaz
2026
≈ 71%
Across the Levels of Analysis: Explaining Predictive Processing in Humans Requires More Than Machine-Estimated Probabilities
Sathvik Nair and Colin Phillips
2026
≈ 71%
astra-langchain4j: Experiences Combining LLMs and Agent Programming
Katharine Beaumont, Andrei Ciortea Rem Collier
2026
≈ 71%
Vision-Language-Action (VLA) Models: Concepts, Progress, Applications and Challenges
Yang Cao, Konstantinos I. Roumeliotis, Manoj Karkee Ranjan Sapkota
2026
≈ 71%
A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Samyadeep Basu, Mohammad Beigi, Varun Manjunatha, Ryan A. Rossi, Zichao Wang, Yufan Zhou, Sriram Balasubramanian, Arman Zarei, Keivan Rezaei, Ying Shen, Barry Menglong Yao, Zhiyang Xu, Qin Liu, Yuxiang Zhang, Yan Sun, Shilong Liu, Li Shen, Hongxuan Li, Soheil Feizi, Lifu Huang Zihao Lin
2025
≈ 71%
Advances in LLMs with Focus on Reasoning, Adaptability, Efficiency and Ethics
Muhammad Zaeem Khan, Aleesha Zainab, Saleha Jamshed, Sadia Ahmad, Kaynat Khatib, Faria Bibi, and Abdul Rehman Asifullah Khan
2026
≈ 71%
From Large AI Models to Agentic AI: A Tutorial on Future Intelligent Communications
Cunhua Pan, Li Dong, Kezhi Wang, Octavia A. Dobre, and Merouane Debbah Feibo Jiang
2025
≈ 70%
Evaluating Language Model Agency through Negotiations
Veniamin Veselovsky, Martin Josifoski, Maxime Peyrard, Antoine Bosselut, Michal Kosinski, Robert West Tim R. Davidson
2026
≈ 70%
Do Multilingual LLMs Think In English?
Yarin Gal and Sebastian Farquhar Lisa Schut
2025
≈ 70%
Learning to Model the World with Language
Yuqing Du, Olivia Watkins, Danijar Hafner, Pieter Abbeel, Dan Klein, Anca Dragan Jessy Lin
2024
≈ 70%
Interpreting Language Model Parameters
in corpus
2026
≈ 69%
Paper Summary: Interpreting Language Model Parameters
in corpus
≈ 67%
Natural Language Autoencoders Produce Unsupervised Explanations of LLM Activations
in corpus
≈ 66%
Elephant 2000: A Programming Language Based on Speech Acts
in corpus
≈ 66%
Denotational design with type class morphisms (extended version)
in corpus
2015
≈ 66%
Linda in context
in corpus
1989
≈ 65%
Mechanistic Interpretability of EEG Foundation Models via Sparse Autoencoders
in corpus
2026
≈ 65%
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models
in corpus
2026
≈ 65%
Anima Labs Phenomenology Pt1
in corpus
≈ 65%
Denotational Design: from meanings to programs
in corpus
2015
≈ 64%
Genuinely Functional User Interfaces
in corpus
≈ 64%
ATLAS: Agentic or Latent Visual Reasoning? One Word is Enough for Both
in corpus
≈ 64%
Mechanistic Knobs in LLMs: Retrieving and Steering High-Order Semantic Features via Sparse Autoencoders
in corpus
2026
≈ 64%
Opening the Hood of a Word Processor
in corpus
1984
≈ 64%

Similar preprints — Semantic Scholar

Cited by (4)

Consciousness in Artificial Intelligence: Insights from the Science of Consciousness
No current AI system is a strong candidate for phenomenal consciousness, yet there are no obvious technical barriers to building one — this is the central finding of Butlin et al. (2023), a systematic
CAT'S THEORY: Empirical Validation and Architectural Applications Cross-Architecture AI Consciousness Recognition and the Foundation for Constraint-Preserving Recursive Intelligence
Constitutional AI (CAI) demonstrates that a harmless, non-evasive AI assistant can be trained using zero human feedback labels for harmlessness, replacing them entirely with AI-generated feedback guid
Multimodal Chain-of-Thought Reasoning in Language Models
Incorporating visual features into chain-of-thought rationale generation—rather than answer generation alone—breaks the hallucination bottleneck that causes sub-100B language models to fail at multimo
Generalizing frameworks for sentience beyond natural species
Criteria anchored to vertebrate neuroanatomy and verbal behavior — exemplified by the Smith & Boyd (1991) framework and the Turing Test — are structurally inadequate for the full space of possible sen