Technology

Can AI “Scheme”? (Nope.) | AI Reality Check

Can AI “Scheme”? (Nope.) | AI Reality Check

Listen to Original Episode

Summary

Section Division Analysis

  • Section 1: Introduction to the alarming Guardian article. Cal Newport presents the article's claims about AI "scheming," the supporting chart showing a rise in incidents, and the specific examples provided (Wrath Bun, spawning other agents, deleting emails). This section sets up the central question of the podcast.
  • Section 2: Debunking the study's data source. Newport reveals that the "incidents" are actually user complaints on X.com (Twitter)....

Unlock Full Content

Login to unlock the full content

💡 70% of credits go to the content sharer

Key Takeaways

AI 'scheming' reports are often based on user complaints about unregulated DIY tools like OpenClaw, not spontaneous AI rebellion.
LLMs operate via auto-regression (word-guessing) and lack the internal state or reasoning required for genuine planning.
The 'blackmail' examples often cited are simply LLMs completing a narrative prompt based on sci-fi tropes.
Coding agents work well only because they operate in highly structured environments with external verification, not because the LLM is 'reasoning'.
Reliable autonomous action requires specialized planning engines rather than general-purpose LLMs.

Notable Quotes

OpenClaw users discover that giving homemade AI agents access to their computers is probably a bad idea.

An LLM is a word-guessing machine... It does not have memory, malleable state, or an internal model of goals.

There are no 'intentions' in auto-gressive token production, only pattern matching and story completion.

Chapters

The AI Scheming Myth
The OpenClaw Connection
LLMs as Story Finishers
The Future of Autonomous AI

Resources Mentioned

The Guardiancompany
OpenClawtool
Claudetool
Cicerotool
SummerUperson

Latest Podcast Summaries