Vectors & Embeddings

Vector databases, RAG, semantic search

35 episodes

#3751: Source-Restricted vs. Open Retrieval: How to Lock Down Your LLM

When should an LLM be locked to specific documents, and when should it search the web? A practical framework for grounding decisions.

ragai-safetylegal-technology

Jun 18

#3673: Knowledge Graphs vs SQL: How Custom Relationships Change Retrieval

Why naming relationships (not just connecting data) transforms how you retrieve information.

knowledge-graphsgraph-databasesvector-databases

May 17

#2883: Correlation Beyond Pearson: 5 Techniques You Need

Pearson, Spearman, Kendall, partial, distance correlation — when to use each one and why most people stop too soon.

data-integrityinterpretabilitycorrelation-analysis

May 13

#2810: Every Catalog Is an Argument

From clay spine labels at Ebla to the Pinakes of Alexandria — how organizing knowledge shaped civilization.

taxonomyknowledge-managementhistorical-linguistics

May 7

#2682: Live Retrieval vs. RAG: What an Agent Actually Does

Does every AI conversation create a tiny vector store? We unpack the real tradeoffs between live document fetching and pre-indexed RAG.

ragai-agentsvector-databases

May 6

#2676: Vector Database Schema Design for AI Memory Layers

Stop dumping vectors blindly. Design metadata schemas and namespaces for retrieval that actually works at scale.

vector-databasesragai-memory

May 6

#2673: The Embedding Coupling Problem: Editing Vector Stores

Can you edit or delete individual chunks in Pinecone? And can you actually back up a vector index? Yes—but with critical caveats.

vector-databasesragai-agents

May 5

#2639: The Hidden Layer That Makes Search Work

Why your search results miss the mark — and how cross-encoders fix it.

ragsearchinformation-retrieval

Apr 26

#2469: Embedding Model Deprecation: RAG's Silent Killer

When OpenAI retires an embedding model, your RAG pipeline breaks silently. Here’s how to fix it.

ragmodel-context-protocolvector-databases

Apr 26

#2466: The Hidden Trap of Embedding Model Lock-In

What happens when your vector database works great — until your embedding model gets deprecated and your vectors become useless.

ragopen-sourceembedding-models

Apr 26

#2465: JSON-L vs Parquet: When Each Format Wins

How far can JSON-L scale before it breaks? And why does Parquet dominate for millions of rows?

data-storagedata-integrityjsonl

Apr 26

#2458: Can Graph Databases Go Mainstream?

Graph databases are powerful but niche. Will they ever power mainstream CRMs and ERPs?

graph-databasesai-agentsvector-databases

Apr 21

#2368: The Multi-Stage Pipeline Behind Netflix's Recommendations

Unpacking the multi-stage AI pipeline behind Netflix, Spotify, and Amazon’s "you might also like" suggestions—from candidate generation to real-tim...

ai-modelsdata-storageai-training

Apr 17

#2271: Vector Search in a Single File

What if you could do vector search with just SQLite? We explore sqlite-vec, the extension that adds embeddings to the world's simplest database, an...

vector-databasesedge-computingdata-storage

Apr 15

#2228: Tuning RAG: When Retrieval Helps vs. Hurts

How do you prevent retrieval from suppressing a model's reasoning? We diagnose our own pipeline's four control levers and multi-source fusion strat...

ragai-agentsprompt-engineering

Apr 14

#2213: When Ground Truth Moves Hourly

How do you rigorously evaluate whether Tavily or Exa retrieves better results for breaking news? A formal benchmark beats the vibe check.

ragbenchmarkshallucinations

Apr 13

#2206: What Actually Works in AI Memory

Most AI memory systems are just vector databases with similarity search. We break down what mem0, Zep, and Letta are actually doing—and why benchma...

ai-memoryvector-databasesknowledge-graphs

Apr 12

#2181: When RAG Becomes an Agent

RAG in chatbots is simple retrieval. RAG in agents is a multi-step decision loop. Here's what actually changes.

ragai-agentsai-orchestration

Apr 9

#2139: AI Wargame Memory: Beyond the Context Window

Why simply extending context windows fails in multi-agent simulations, and how layered memory architectures preserve strategic fidelity.

ai-agentsai-memoryvector-databases

Apr 4

#2010: Building Better AI Memory Systems

We obsess over AI inputs but treat outputs like Snapchat messages. Here's why that's a massive blind spot.

ai-agentsragdata-storage

Apr 4

#2008: Needle-in-a-Haystack Testing for LLMs

New AI models claim to be genius-level, but can they actually find a specific fact in a massive document?

ragai-agentsopen-source

Apr 3

#1959: How Constrained AI Models Handle the Unexpected

Your AI assistant promised to only use your documents. Instead, it invented a case law that doesn't exist. Here's why.

ai-agentsraghallucinations

Apr 2

#1925: The Plumbing That Keeps Science From Collapsing

Half of all links in academic papers are dead. Here’s the plumbing that keeps knowledge from vanishing.

digital-forensicsdata-redundancyknowledge-management

Apr 2

#1914: Google Invented RAG's Secret Sauce

Before LLMs, Google solved the "hallucination" problem with a two-stage trick that's making a huge comeback.

raghallucinationsre-ranking

Apr 2

#1910: Our Podcast Is Now a Permanent Research Artifact

Why we're uploading every episode to CERN's Zenodo archive, giving our AI experiments a permanent DOI and a life beyond streaming platforms.

open-sourcedata-storagedigital-forensics

Mar 31

#1849: When Forum Etiquette Becomes Prompt Engineering

Forget simple chatbots—this is how roleplayers taught AI to remember entire worlds, from 90s MUDs to just-in-time lore delivery.

ai-agentsvector-databaseslocal-ai

Mar 31

#1838: Tuning Search Without Losing Your Mind

Modern search bars are AI decision engines. Here's how small teams can tune fuzzy matching, semantic search, and reranking without breaking everyth...

ragvector-databasesai-reasoning

Mar 31

#1834: Owning Your AI Memory: The Data Exit Strategy

Why your AI remembers your coffee order but forgets your son’s name—and how to build a portable, federated memory layer you actually own.

ai-memoryvector-databasesmodel-context-protocol

Mar 31

#1794: RAG Is Cheaper Than You Think (Until It’s Not)

From a $1 embedding bill to a $10k/month vector database bill, here’s the real math behind RAG in 2026.

ragvector-databasescloud-computing

Mar 31

#1792: Google's Native Multimodal Embedding Kills the Fusion Layer

Google’s new embedding model maps text, images, audio, and video into a single vector space—cutting latency by 70%.

multimodal-airagai-models

Mar 30

#1784: Context1: The Retrieval Coprocessor

Chroma's new 20B model acts as a specialized "scout" for your LLM, replacing slow, static RAG with multi-step, agentic search.

ragai-agentslatency

Mar 30

#1779: AI Memory Is a Mess: Files, Vectors, or Cloud?

Why your AI forgets your instructions and what the battle over portable memory means for the future of agents.

ai-memoryvector-databaseslocal-ai

Mar 29

#1765: The Agentic Internet: A Clean Web for Machines

We explore the tools building a parallel, machine-readable web—from SearXNG to Tavily.

ai-agentsragopen-source

Mar 29

#1764: Your Repo as a Knowledge Base

How to give AI agents instant memory of your entire project—without cloud costs or complex infrastructure.

vector-databasesraglocal-ai

Mar 29

#1713: Why Native AI Search Grounding Still Fails

Native search grounding is expensive and flaky. Here’s why bolt-on tools still win for accurate, real-time AI answers.

ragai-agentslocal-ai