Audio Processing
Noise removal, audio quality, microphones, VAD
12 episodes
#3446: Where to Clip a Speaker for the Best Sound
Tiny placement changes dramatically alter sound. Learn the physics of where to clip your speaker for the best audio.
#2886: How Acoustic Cameras Catch Honking Drivers
Can an acoustic camera pinpoint one honk in a traffic jam? The tech is real, and fines are being issued.
#2602: Letting Non-Experts Direct Audio Tools Through Conversation
How to use AI for podcast mastering — and why agentic AI works better for small tasks than big promises.
#2590: The Uncanny Valley of Clean Speech
How transformer models distinguish "um" from meaningful speech — and why removing too much makes you sound like a robot.
#2582: What Your Browser Does to Mic Audio Before It Reaches Your Server
getUserMedia returns audio, but not raw audio. Here's what browsers actually do to your mic feed before it hits your server.
#2563: How Audio Fingerprinting Actually Works
Spectrogram peaks, constellation maps, and hash matching — the elegant mechanics behind identifying any song in seconds.
#2543: Why Base64 Adds 33% Overhead (And Why You Still Need It)
Base64 isn’t compression — it’s a safe transport encoding. Here’s how it works with audio APIs and where its limits are.
#2486: Why Noise Reduction Can Ruin Transcription Accuracy
Cleaning audio before transcription can increase errors by up to 46%. Here's the right approach for your voice app.
#2337: When Diarization Fails Silently
Discover how PyAnnote and other tools tackle the critical task of identifying "who spoke when" in audio—and why it’s harder than it sounds.
#2288: The Invisible Gatekeeper of Voice Tech
How voice activity detection shapes every step of the voice tech pipeline, and why it’s harder than it seems.
#2183: Making Voice Agents Feel Natural
Turn-taking, interruptions, and latency are destroying voice AI UX—and the fixes are deeply technical. Here's what's actually happening underneath.
#1800: Hacking the Brain's Alarm System
Why some sounds make your skin crawl: the science of emergency alerts.