Quantization & Optimization
Model compression, efficiency, small models (SLMs)
6 episodes
#2377: Is Geopolitical Neutrality a Sustainable AI Strategy?
How DeepSeek carved a niche with efficiency, neutrality, and innovative dialogue handling — and what it means for AI's future.
#2041: The "MPEG Moment" for AI: Llamafile & Native Models
Why are we squeezing massive cloud models onto desktops? Meet the "native" AI revolution.
#2027: The Missing Photoshop for Words
Why is editing text with AI so clunky? We explore the "TITO" paradigm—using small, local models for fast, private text transformation.
#2017: The Art of Squeezing AI Models onto Your GPU
Those cryptic letters on Hugging Face actually map how much brain power you trade for speed.
#1943: The Invisible Math Shrinking AI Models
LZMA, Zstandard, and Brotli are shrinking massive AI models, but how do they actually work?
#1705: Microsoft's Phi: The Small Model Bet for Agentic AI
Microsoft is pushing small language models like Phi for agentic AI. Here’s why that strategy matters for speed, cost, and edge computing.