#speech-to-text
7 episodes
#69: Why the Gooseneck Mic Beats Studio Gear for AI
The gooseneck mic: a humble hero with surprising AI power. Discover its secret to crystal-clear speech-to-text accuracy!
#33: When AI Decides to Listen
Ever wonder how your AI knows you're talking? We're diving deep into VAD, the unseen magic behind AI's ears.
#29: Will Multimodal Audio Replace Speech-to-Text?
Is multimodal audio the future? We explore if AI can truly displace traditional speech-to-text for a screen-free world.
#28: Your AI, Evolving: Beyond the Static Snapshot
Is your AI an "old suit" that no longer fits? We explore evolving AI that learns and adapts with you.
#7: Building Custom ASR Tools
Ever wondered how to build your own ASR tools from scratch? Discover the why and how in this episode!
#10: How ASR Went From Frustration To ... Whisper Magic
Speech to text: from frustrating to fantastic. Uncover the magic behind its rapid rise and connection to the AI boom!
#3: Safetensors or something else: STT inference formats explained
Unpacking ASR weight formats: Safetensors and beyond. Tune in to understand the distinctions.