![Nicholas Carlini - Black-hat LLMs | [un]prompted 2026](https://img.youtube.com/vi/1sd26pWhfmg/hqdefault.jpg)
Nicholas Carlini - Black-hat LLMs | [un]prompted 2026
Content Summary
Discussion & OpinionNicholas Carlini - Black-hat LLMs | [un]prompted 2026 • unprompted
TL;DR
Nicholas Carlini from Anthropic demonstrates that current LLMs can autonomously find and exploit zero-day vulnerabilities in critical software like the Linux kernel and major web applications, using minimal scaffolding (essentially just Claude Code with a simple prompt). He argues this capability has emerged only in the last 3-4 months, is improving exponentially, and represents the most significant shift in security since the invention of the internet — requiring urgent collective action to ensure defenders can keep pace during this critical transitionary period.
ELI5
Imagine a really smart robot that can look at all the locks on all the doors in a building and find the ones that are broken — even ones that nobody knew were broken for 20 years! Now imagine that robot is getting smarter every few months. That's what's happening with AI and computer security, and we need lots of people to help fix all those broken locks before bad guys use the same robot.
Top Concepts
Keywords
Quick Actions
- !Begin contributing to defensive LLM security efforts immediately - at Anthropic, DeepMind, OpenAI, or independently
- !Assume your software has undiscovered vulnerabilities that LLMs can now find autonomously with minimal scaffolding
- !Invest in rewriting critical software components in memory-safe languages like Rust
Want to analyze your own content?
Extract insights from YouTube videos, PDFs, and web articles. Free to start.
Try Knowmler Free