#ai-agents
344 episodes · Page 5 of 15
#2142: The Nervous System of Multi-Agent Systems
We break down the plumbing that lets a parent agent know exactly when a subagent finishes, from message passing to lifecycle events.
#2141: Choosing Your Durable Execution Platform
Why building AI agents means managing infrastructure. We explore durable execution backends like Temporal and AWS Step Functions.
#2139: AI Wargame Memory: Beyond the Context Window
Why simply extending context windows fails in multi-agent simulations, and how layered memory architectures preserve strategic fidelity.
#2137: Wargaming's Methodology, Not Magic
Most AI wargames are just expensive role-play. Here's the professional methodology they're missing.
#2136: The Brutal Problem of AI Wargame Evaluation
Most AI wargame simulations skip evaluation entirely or rely on token expert reviews. This is the field's biggest credibility problem.
#2135: Is Your AI Wargame Signal or Noise?
Monte Carlo methods promise statistical rigor for AI wargaming, but the line between genuine insight and sampling noise is thinner than you think.
#2134: The Fog-of-War Problem in AI Wargaming
Why shared AI brains make secret-keeping a nightmare, and the four architectural patterns researchers use to fix it.
#2133: Engineering Geopolitical Personas: Beyond Caricatures
How to build LLMs that simulate state actors with strategic fidelity, not just surface mimicry.
#2132: The Referee's Dilemma: Epistemic Containment in LLM Simulations
Why do AI war games need a news blackout? We dissect the firewall that keeps LLM actors from cheating with real-world data.
#2131: The CIA Is on GitHub
In-Q-Tel is on GitHub. Explore the IC's strategic investment arm and its use of open-source AI for wargaming.
#2129: Shifting Left on Hallucinations
Stop hoping your AI doesn't lie. We explore the shift to deterministic guardrails, specialized judge models, and the tools making agents reliable.
#2125: Why Agentic Chunking Beats One-Shot Generation
A single prompt can't write a 30-minute script. Here’s the agentic chunking method that fixes coherence.
#2114: 2026 ERP: From Filing Cabinet to Autonomous Core
In 2026, ERP systems have evolved from digital filing cabinets into autonomous, AI-driven cores that predict and execute business decisions in real...
#2113: Goldfish vs Elephant: The Stateful Agent Dilemma
Stateless agents are cheap and fast, but stateful ones remember your window seat. Which architecture wins?
#2111: From Bricklayer to Foreman: AI's Dev Role Shift
AI frameworks are exploding while languages stay stable. Learn why core dev knowledge is shifting from syntax to systems thinking.
#2110: Tuning AI Personality: Beyond Sycophancy
AI models swing between obsequious flattery and cold dismissal. Here’s why that happens and how to fix it.
#2109: AI Is Forcing You to Use React
AI tools are reshaping developer stacks, favoring React and Postgres over niche frameworks.
#2103: AI Firewalls: Spotting Bombs on an Encrypted Conveyor Belt
With 95% of web traffic encrypted, firewalls can't read packets. Here's how AI analyzes metadata to detect threats without decryption.
#2102: Why Don't You Notice AI Security Delays?
Multi-layer security checks add latency, but modern CLIs hide it under 100ms using parallelization and speculation.
#2089: Open-Source vs. Military ATR: The Drone Recognition Gap
A public GitHub model spotted by a listener reveals the massive gap between hobbyist AI and lethal military drone detection systems.
#2075: AI Agents for Israel: Hyper-Local Skills in Action
How reusable AI "skills" are solving real Israeli problems—from shelter navigation to tax compliance.
#2074: Generative Social Science: When AI Agents Develop Theory of Mind
See how a new framework models 10,000 virtual citizens to test policies before spending a dime.
#2071: Git Can't Handle AI Agents—Yet
Three AI agents in one repo is pure chaos. Here's why Git's design causes collisions—and how worktrees and locks can save your sanity.
#2069: The Vibe Coding Trap: Why Your Agent Skills Keep Breaking
Stop guessing at the agentskills.io spec. Learn the exact YAML fields, directory structure, and authoring patterns to make Claude Code skills that ...