#quantization
13 episodes
#2041: The "MPEG Moment" for AI: Llamafile & Native Models
Why are we squeezing massive cloud models onto desktops? Meet the "native" AI revolution.
#2017: That Q4_K_M Is Not a Cat Sneeze
Those cryptic letters on Hugging Face actually map how much brain power you trade for speed.
#1559: Dark Knowledge: The Art of AI Model Distillation
Discover how model distillation transfers "dark knowledge" from massive AI giants into tiny, efficient models that live in your pocket.
#1479: The Speed of Thought: Inside the New Era of Inference
The war for model size is over. Explore the engineering breakthroughs making massive AI models faster than human thought.
#1099: Digital Recalls: Why Your AI Is Losing Its Edge
Is your AI getting lazier? Explore the "digital recall" and why the world’s most advanced models are secretly taking steps backward.
#1094: The CPU-First Era: Why AI is Moving Back to the Processor
Is the GPU's reign over? Discover how modern CPUs and clever optimization are bringing powerful AI models to the hardware you already own.
#1085: The Tokenization Lie: How AI Actually Processes Media
Think 1,000 tokens equals 750 words? For audio and video, that rule is a lie. Discover the hidden math behind multimodal AI.
#992: Beyond the Digital Sandwich: The Future of Voice AI
Is speech recognition dead? Explore how multimodal models are replacing the "digital sandwich" with true intent-based reasoning.
#477: Can Your Phone Actually Think Without the Cloud?
Can your phone finally think for itself? Explore the hardware and software breakthroughs bringing agentic AI to the palm of your hand.
#120: Silencing the Siren: Real-Time AI Noise Reduction
How do phones remove sirens and crying babies in real time? Explore the neural networks and hardware making crystal-clear audio possible.
#108: The Mystery of Model Rot: Why Your AI Code Assistant Changes
Why do AI models lose their edge over time? Herman and Corn explore the "home team advantage" and why telling your AI to "do better" actually works.
#107: The $5.5 Million Breakthrough: DeepSeek’s AI Disruption
Discover how DeepSeek-V3 is disrupting the AI market with massive cost savings and technical innovations like Multi-Head Latent Attention.
#41: Local AI Unlocked: The Power of Quantization
Unlock powerful AI on your device! We demystify quantization, the ingenious trick making local AI a reality.