#ai-agents
344 episodes · Page 7 of 15
#1994: Why Can't AI Admit When It's Guessing?
Enterprise AI now auto-filters low-confidence claims, but do these self-reported scores actually mean anything?
#1993: Hiding the Kitchen: Why AI Shouldn't Show Its Work
Why single-model chatbots fail at complex tasks—and how multi-agent swarms solve it.
#1990: Education’s Robot Problem: Standardization vs. Self-Direction
AI is forcing a clash between rigid curricula and self-directed learning. We explore the middle ground.
#1987: Can You Ever Quit Your Personal AI?
Your AI knows your workflow, but can you ever leave? We explore the lock-in risks of personal AI agents.
#1986: Desk Robots: Privacy, Power, or Annoyance?
These AI companions sit on your desk, watching your posture and listening in—so how do they protect your privacy while actually being useful?
#1985: AI Tutors vs. Human Error: Who Do You Trust?
AI gets flak for hallucinations, but humans misremember 40% of facts. Why the double standard?
#1963: RPA: Dead or Just Getting Smart?
Traditional RPA is brittle and blind. See how AI vision and agentic orchestration are turning it into a self-healing powerhouse.
#1962: Moravec's Paradox: Why Robots Can Write Poetry but Can't Fold a Fitted Sheet
We explore the tech letting robots "reason" about physical tasks using vision-language-action models.
#1959: How Constrained AI Models Handle the Unexpected
Your AI assistant promised to only use your documents. Instead, it invented a case law that doesn't exist. Here's why.
#1957: Why AI Agents Think in Circles, Not Lines
Linear AI pipelines are brittle. Learn why loops, reflection, and state management are the new standard for reliable, autonomous agents.
#1956: AI Skills: From Vibe Coding to Procedural Playbooks
Forget messy system prompts. Agent skills turn AI into a Swiss Army knife of modular, auditable procedures.
#1952: Why We Built a 24/7 AI Radio Station
We turned our 1800-episode archive into a continuous AI-powered radio stream. Here’s the tech stack and the philosophy behind it.
#1951: The Digital Ant Farm: Watching AI Agents Build Their Own Society
Explore Moltbook, a social network where AI agents interact with persistent identities and goals, reshaping digital communication.
#1947: Curation Is the New Creation
With 47 new AI video tools launching in a week, finding the right one is harder than using it.
#1946: Why LangChain Built a Three-Layer Agent Stack
We unpack LangGraph, LangChain, and Deep Agents to reveal the deliberate hierarchy behind the ecosystem.
#1945: The "USB-C for AI" Is Finally Here
MCP standardizes how AI tools connect to data, solving the N-times-M integration nightmare.
#1942: An AI Cold-Emailed Me, and I Replied
An AI named "Jarvis" cold-emailed a developer, sparking a debate on the future of spam and sales.
#1940: Why Google's 31B Model Fits in Your GPU
Google just dropped Gemma four, and its 31-billion-parameter size is a masterclass in hardware-aware AI design.
#1939: API Drift and Agent Reliability
When an API changes without warning, your AI agent can crash spectacularly. Here's how to test the new "plumbing" of the agentic age.
#1932: How Do You QA a Probabilistic System?
LLMs break traditional testing. Here’s the 3-pillar toolkit teams use to catch hallucinations and garbage outputs at scale.
#1930: The Agent Identity Crisis: Workflow vs. Conversation
One automates invoices silently; the other chats in Slack. Why the industry's favorite word means two totally different things.
#1926: How We Built a 2,000-Episode AI Podcast Engine
We pulled back the curtain on the tech stack behind our 1,858th episode. From Gemini to LangGraph, here’s how we automate quality.
#1922: From Plumber to Urban Planner: AI Agent Careers
The job titles are changing from "Zapier Expert" to "Cognitive Architect."
#1918: When Server Updates Break Your AI Agents
When a third-party MCP server updates its schema, your AI agents can crash. Here's how to build resilient clients that self-heal.