
Richard Sutton – Father of RL thinks LLMs are a dead end
Content Summary
InterviewRichard Sutton – Father of RL thinks LLMs are a dead end • Dwarkesh Patel
TL;DR
Richard Sutton, the Turing Award-winning father of reinforcement learning, argues that LLMs are fundamentally limited because they learn from human-generated text rather than from direct experience with the world, lack genuine goals, and cannot learn continually. He advocates for an experiential paradigm where agents learn through trial-and-error interaction with their environment—sensing, acting, and receiving rewards—which he believes will ultimately supersede LLMs as another instance of the bitter lesson, where scalable general methods defeat approaches built on human knowledge.
ELI5
Imagine you have a parrot that can repeat everything people say really well. Richard Sutton says that's what big AI models are like—they're just copying what people say! But a truly smart pet, like a dog, learns by trying things itself: it figures out that if it sits, it gets a treat. Sutton thinks AI needs to be more like the dog—learning by doing things and seeing what happens—not like the parrot just copying words.
Top Concepts
Keywords
Quick Actions
- !Evaluate whether your AI system has a substantive goal that affects the external world, not just prediction accuracy
- !Invest in building continual learning capabilities into AI agents rather than relying solely on pre-training
- !Design agents with four core components: policy, value function, perception/state representation, and world transition model
Want to analyze your own content?
Extract insights from YouTube videos, PDFs, and web articles. Free to start.
Try Knowmler Free