AI Core

Fundamentals of AI models, architecture, and how they work

221 episodes · Page 7 of 10

#2016: Andrej Karpathy: The Bob Ross of Deep Learning

Why the most influential AI mind prefers a blank text file to proprietary black boxes.

ai-trainingopen-source-aiai-reasoning

#2010: Building Better AI Memory Systems

We obsess over AI inputs but treat outputs like Snapchat messages. Here's why that's a massive blind spot.

ai-agentsragdata-storage

#2008: Needle-in-a-Haystack Testing for LLMs

New AI models claim to be genius-level, but can they actually find a specific fact in a massive document?

ragai-agentsopen-source

#2007: AI Grading AI: The Snake Eating Its Tail

We asked an AI to write this script. Then we asked another AI to grade it. Here’s what happens when the judges have biases.

llm-as-a-judgehallucinationsai-ethics

#2006: How Do You Measure an LLM's "Soul"?

Traditional benchmarks can't measure tone or empathy. Here's how to evaluate if an AI model truly "gets it right."

llm-as-a-judgeai-ethicsai-safety

#2005: Beyond Vibes: The Hard Science of LLM Evaluation

Running the same LLM on different GPUs can produce different results. Here’s why that happens and how to test for it.

llm-as-a-judgeragcontext-window

#1994: Why Can't AI Admit When It's Guessing?

Enterprise AI now auto-filters low-confidence claims, but do these self-reported scores actually mean anything?

ai-agentsai-safetyrag

#1992: The Sovereign Compute Shift: Owning vs. Renting AI Iron

Israel is building a sovereign AI supercomputer with 4,000 Nvidia B200 GPUs to keep startups local.

gpu-accelerationnational-securityinfrastructure

#1991: Why 20 Clean Qubits Beat 1000 Noisy Ones

Israel just unveiled its first 20-qubit superconducting quantum computer, and it's not about size—it's about precision and control.

israelaerospace-engineeringmaterial-science

#1985: AI Tutors vs. Human Error: Who Do You Trust?

AI gets flak for hallucinations, but humans misremember 40% of facts. Why the double standard?

ai-agentsai-safetyreliability

#1979: When Marketing Swallows the Tech

Is AI the same as Machine Learning? We break down the nested hierarchy of artificial intelligence, from symbolic logic to neural networks.

ai-historyai-modelssymbolic-ai

#1962: Moravec's Paradox: Why Robots Can Write Poetry but Can't Fold a Fitted Sheet

We explore the tech letting robots "reason" about physical tasks using vision-language-action models.

ai-agentscomputer-visionreasoning-models

#1959: How Constrained AI Models Handle the Unexpected

Your AI assistant promised to only use your documents. Instead, it invented a case law that doesn't exist. Here's why.

ai-agentsraghallucinations

#1957: Why AI Agents Think in Circles, Not Lines

Linear AI pipelines are brittle. Learn why loops, reflection, and state management are the new standard for reliable, autonomous agents.

ai-agentsprompt-injectionai-safety

#1946: Why LangChain Built a Three-Layer Agent Stack

We unpack LangGraph, LangChain, and Deep Agents to reveal the deliberate hierarchy behind the ecosystem.

ai-agentssoftware-developmentdistributed-systems

#1943: The Invisible Math Shrinking AI Models

LZMA, Zstandard, and Brotli are shrinking massive AI models, but how do they actually work?

data-integritysoftware-developmenthigh-performance-computing

#1940: Why Google's 31B Model Fits in Your GPU

Google just dropped Gemma four, and its 31-billion-parameter size is a masterclass in hardware-aware AI design.

open-source-aigpu-accelerationai-agents

#1938: JSON-to-SQL Type Mapping: A Practical Guide

Mapping JSON to SQL isn't as simple as it looks. Discover the hidden traps in data types that can cause performance hits and data corruption.

data-integritysoftware-developmentdistributed-systems

#1932: How Do You QA a Probabilistic System?

LLMs break traditional testing. Here’s the 3-pillar toolkit teams use to catch hallucinations and garbage outputs at scale.

ai-agentsai-safetyhallucinations

#1931: Where Your AI Pipeline Actually Dies

Why do AI pipelines crash? It’s not the models—it’s the plumbing. We break down how to manage data between stages.

distributed-systemsdata-redundancyhigh-availability

#1929: From Vibe Checks to Model Metrics

We stopped "vibe-checking" our AI scripts and built a science fair for models. Here's how we grade them.

ai-modelsprompt-engineeringai-ethics

#1927: Workers vs. Servers: The 2026 Compute Showdown

Is the persistent server dead? We compare Cloudflare Workers, GitHub Actions, and VPS options for modern app architecture.

edge-computingserverless-gpulatency

#1925: The Plumbing That Keeps Science From Collapsing

Half of all links in academic papers are dead. Here’s the plumbing that keeps knowledge from vanishing.

digital-forensicsdata-redundancyknowledge-management

#1914: Google Invented RAG's Secret Sauce

Before LLMs, Google solved the "hallucination" problem with a two-stage trick that's making a huge comeback.

raghallucinationsre-ranking