Nicholas Carlini - Black-hat LLMs | [un]prompted 2026

Nicholas Carlini - Black-hat LLMs | [un]prompted 2026

YouTube Videounprompted5,310 words
View original

Content Summary

Nicholas Carlini - Black-hat LLMs | [un]prompted 2026unprompted

10 concepts10 actions20 keywords

TL;DR

Nicholas Carlini from Anthropic demonstrates that current LLMs can autonomously find and exploit zero-day vulnerabilities in critical software like the Linux kernel and major web applications, using minimal scaffolding (essentially just Claude Code with a simple prompt). He argues this capability has emerged only in the last 3-4 months, is improving exponentially, and represents the most significant shift in security since the invention of the internet — requiring urgent collective action to ensure defenders can keep pace during this critical transitionary period.

ELI5

Imagine a really smart robot that can look at all the locks on all the doors in a building and find the ones that are broken — even ones that nobody knew were broken for 20 years! Now imagine that robot is getting smarter every few months. That's what's happening with AI and computer security, and we need lots of people to help fix all those broken locks before bad guys use the same robot.

Top Concepts

Keywords

Quick Actions

  • !Begin contributing to defensive LLM security efforts immediately - at Anthropic, DeepMind, OpenAI, or independently
  • !Assume your software has undiscovered vulnerabilities that LLMs can now find autonomously with minimal scaffolding
  • !Invest in rewriting critical software components in memory-safe languages like Rust
1m 25s36,748 tokens
Claude Opus 4.5prompts v1.2v1.0?

Want to analyze your own content?

Extract insights from YouTube videos, PDFs, and web articles. Free to start.

Try Knowmler Free