#speech-to-text
8 episodes
Unsung Hero: The Gooseneck Mic's AI Power
The gooseneck mic: a humble hero with surprising AI power. Discover its secret to crystal-clear speech-to-text accuracy!
The Unseen Magic of AI's Ears: Decoding VAD
Ever wonder how your AI knows you're talking? We're diving deep into VAD, the unseen magic behind AI's ears.
The Multimodal Audio Revolution: A Screen-Free Future?
Is multimodal audio the future? We explore if AI can truly displace traditional speech-to-text for a screen-free world.
Your AI, Evolving: Beyond the Static Snapshot
Is your AI an "old suit" that no longer fits? We explore evolving AI that learns and adapts with you.
Building Custom ASR Tools
Ever wondered how to build your own ASR tools from scratch? Discover the why and how in this episode!
Local STT For AMD GPU Owners
AMD GPU? No problem! Dive into local AI adventures like on-device speech to text.
How ASR Went From Frustration To ... Whisper Magic
Speech to text: from frustrating to fantastic. Uncover the magic behind its rapid rise and connection to the AI boom!
Safetensors or something else: STT inference formats explained
Unpacking ASR weight formats: Safetensors and beyond. Tune in to understand the distinctions.