#rag

82 episodes · Page 3 of 4

#1731: Why Deep Research Agents Are Being Forgotten

Specialized research agents outperform general orchestrators by 40-60% on verification tasks, yet developer hype is fading. Here's why.

ai-agentsragmodel-context-protocol

Mar 29

#1728: The AI Carpool: Emergent Collaboration Through Role-Playing

CAMEL AI lets two agents role-play to solve tasks autonomously. No complex code—just emergent teamwork.

ai-agentsprompt-engineeringrag

Mar 29

#1727: The Great Architectural Heist: LSP as AI's Universal Plumbing

Explore how the Language Server Protocol is being repurposed to integrate AI directly into code editors, unifying development workflows.

ai-agentssoftware-developmentrag

Mar 29

#1725: The Death of the Lonely Chatbot

Forget chatbots: AI orchestration is now the key to scaling intelligent agents in the enterprise.

ai-agentsdistributed-systemsrag

Mar 29

#1713: Why Native AI Search Grounding Still Fails

Native search grounding is expensive and flaky. Here’s why bolt-on tools still win for accurate, real-time AI answers.

ragai-agentslocal-ai

Mar 29

#1708: Why Your AI Agent Forgets Everything (And How to Fix It)

Learn how Letta's memory-first architecture solves the AI context bottleneck for long-term agents.

ai-agentsragcontext-window

Mar 29

#1700: Can LLMs Learn Continuously Without Forgetting?

We explore a new approach: micro-training updates every few days to keep AI knowledge fresh without constant web searches.

ragfine-tuningai-agents

Mar 28

#1666: The Agent Mesh: Shared Context That Changes Everything

Grok 4.20’s native multi-agent architecture cuts token costs by 75% and enables real-time cross-agent reasoning.

ai-agentstransformersrag

Mar 28

#1629: From DAGs to Loops: Why Agents Need Stateful Cycles

Stop building linear chains and start building cycles to create agents that can reason, self-correct, and maintain complex state.

ai-agentsragcontext-window

Mar 27

#1601: Cohere: The Switzerland of Enterprise AI

While others chase viral memes, Cohere is quietly building the secure, cloud-agnostic infrastructure powering the global enterprise.

ragspeech-recognitiondefense-technology

Mar 27

#1592: The Vector Debt Trap: Choosing Embeddings That Last

Stop treating embedding models like plumbing. Learn how to navigate vector debt, multimodal retrieval, and database configuration for RAG.

ragvector-databasesmultimodal-ai

Mar 26

#1565: Machine-Readable Safety: Markdown for AI Agents

Transform bloated government data into clean Markdown to power life-saving AI agents during emergencies.

ai-agentsragemergency-preparedness

Mar 23

#1482: The Hidden Cost of Choosing an Embedding Model

From Matryoshka models to multimodal search, discover how the fundamental units of AI memory are being optimized for efficiency and scale.

multimodal-aivector-databasesrag

Mar 15

#1212: The Postgres Vector Revolution: Killing the Sprawl

Is your tech stack a sprawling suburb of microservices? Discover why a 40-year-old database is winning the AI infrastructure war.

vector-databasesragarchitecture

Mar 12

#1123: When One Database Isn't Enough

Can Postgres 18 finally replace the data warehouse? We dive into data gravity, columnar storage, and the physics of scaling in the AI age.

architecturevector-databasesrag

Mar 11

#1103: The Kitchen War: When Theory Meets Messy Reality

Explore the mechanics of LLM context windows and attention, and witness what happens when technical debates collide with household chores.

large-language-modelsarchitecturerag

Mar 11

#1100: The Truth Conflict: Why AI Ignores the Facts You Give It

Discover why AI models ignore provided documents in favor of old training data and how to build a reliable "hierarchy of truth" for RAG systems.

raglarge-language-modelsprompt-engineering

Mar 6

#995: Democratizing Intelligence: From PDFs to Policy

How can AI transform dense government reports into actionable intelligence? Explore the physics of Iranian missiles and the future of OSINT.

iranballistic-missilesosintragmissile-defense

Mar 5

#959: The Infinite Content Problem: AI’s War on Truth

Explore how AI is scaling disinformation to an industrial level and what the "liar's dividend" means for the future of shared reality.

ai-agentsragsocial-engineering

Mar 5

#948: Can AI Search Survive the Fog of War and SEO Spam?

Explore how AI is moving from static models to real-time data and whether specialized search tools can survive the rise of the tech giants.

raggenerative-ailatencyanswer-engines

Feb 26

#869: Why Tiny Digital Savants Are Outperforming God-Models

Are massive AI models hitting a wall? Discover why the future belongs to lean, domain-specific "digital savants" and vertical pre-training.

small-language-modelsragfine-tuningai-orchestration2026

Feb 25

#846: Beyond the Vector: Building Long-Standing AI Memory

Stop relying on basic vector search. Discover how Graph RAG and RAPTOR are creating AI systems with true long-standing memory.

ragarchitectureknowledge-graphs

Feb 23

#810: The Agentic Interview: How AI Learns to Know You

Stop dumping data. Discover how agentic interviews are transforming AI from a passive listener into a proactive, structured partner.

ai-agentsragknowledge-graphs

Feb 23

#809: Beyond the Prompt: The Shift to AI Context Engineering

Is prompt engineering still magic, or just plumbing? Explore why the field is shifting toward context engineering and systematic evaluation.

prompt-engineeringarchitecturerag