AI Core

Vectors & Embeddings

Vector databases, RAG, semantic search

21 episodes

#2228: Tuning RAG: When Retrieval Helps vs. Hurts

How do you prevent retrieval from suppressing a model's reasoning? We diagnose our own pipeline's four control levers and multi-source fusion strat...

ragai-agentsprompt-engineering

#2213: Grading the News: Benchmarking RAG Search Tools

How do you rigorously evaluate whether Tavily or Exa retrieves better results for breaking news? A formal benchmark beats the vibe check.

ragbenchmarkshallucinations

#2206: What Actually Works in AI Memory

Most AI memory systems are just vector databases with similarity search. We break down what mem0, Zep, and Letta are actually doing—and why benchma...

ai-memoryvector-databasesknowledge-graphs

#2181: When RAG Becomes an Agent

RAG in chatbots is simple retrieval. RAG in agents is a multi-step decision loop. Here's what actually changes.

ragai-agentsai-orchestration

#2139: AI Wargame Memory: Beyond the Context Window

Why simply extending context windows fails in multi-agent simulations, and how layered memory architectures preserve strategic fidelity.

ai-agentsai-memoryvector-databases

#2010: Building Better AI Memory Systems

We obsess over AI inputs but treat outputs like Snapchat messages. Here's why that's a massive blind spot.

ai-agentsragdata-storage

#2008: Needle-in-a-Haystack Testing for LLMs

New AI models claim to be genius-level, but can they actually find a specific fact in a massive document?

ragai-agentsopen-source

#1959: How Constrained AI Models Handle the Unexpected

Your AI assistant promised to only use your documents. Instead, it invented a case law that doesn't exist. Here's why.

ai-agentsraghallucinations

#1925: The Plumbing That Keeps Science From Collapsing

Half of all links in academic papers are dead. Here’s the plumbing that keeps knowledge from vanishing.

digital-forensicsdata-redundancyknowledge-management

#1914: Google Invented RAG's Secret Sauce

Before LLMs, Google solved the "hallucination" problem with a two-stage trick that's making a huge comeback.

raghallucinationsre-ranking

#1910: Our Podcast Is Now a Permanent Research Artifact

Why we're uploading every episode to CERN's Zenodo archive, giving our AI experiments a permanent DOI and a life beyond streaming platforms.

open-sourcedata-storagedigital-forensics

#1849: The Forever Dungeon Master: SillyTavern's Secret Lorebooks

Forget simple chatbots—this is how roleplayers taught AI to remember entire worlds, from 90s MUDs to just-in-time lore delivery.

ai-agentsvector-databaseslocal-ai

#1838: Tuning Search Without Losing Your Mind

Modern search bars are AI decision engines. Here's how small teams can tune fuzzy matching, semantic search, and reranking without breaking everyth...

ragvector-databasesai-reasoning

#1834: Building Portable Personal Context for AI

Why your AI remembers your coffee order but forgets your son’s name—and how to build a portable, federated memory layer you actually own.

ai-memoryvector-databasesmodel-context-protocol

#1794: RAG Is Cheaper Than You Think (Until It’s Not)

From a $1 embedding bill to a $10k/month vector database bill, here’s the real math behind RAG in 2026.

ragvector-databasescloud-computing

#1792: Google's Native Multimodal Embedding Kills the Fusion Layer

Google’s new embedding model maps text, images, audio, and video into a single vector space—cutting latency by 70%.

multimodal-airagai-models

#1784: Context1: The Retrieval Coprocessor

Chroma's new 20B model acts as a specialized "scout" for your LLM, replacing slow, static RAG with multi-step, agentic search.

ragai-agentslatency

#1779: AI Memory Is a Mess: Files, Vectors, or Cloud?

Why your AI forgets your instructions and what the battle over portable memory means for the future of agents.

ai-memoryvector-databaseslocal-ai

#1765: The Agentic Internet: A Clean Web for Machines

We explore the tools building a parallel, machine-readable web—from SearXNG to Tavily.

ai-agentsragopen-source

#1764: Vector Databases as a Single File

How to give AI agents instant memory of your entire project—without cloud costs or complex infrastructure.

vector-databasesraglocal-ai

#1713: Why Native AI Search Grounding Still Fails

Native search grounding is expensive and flaky. Here’s why bolt-on tools still win for accurate, real-time AI answers.

ragai-agentslocal-ai