Blog
Guides, comparisons, and deep dives on privacy-first transcription, voice-to-text productivity, and the future of local AI.
How VoxBarβ’ built 15 engines on consumer hardware, tested 8 head-to-head under real-world conditions, and discovered what the benchmarks donβt tell you. From Whisper to Voxtral, Canary to Kyutai β we ranked them all so you donβt have to.
Read the full report βA core team of roughly 100 people shipped over 100 frontier AI models in two years β outpacing labs staffed by thousands. Now their leader has resigned, but their open-source legacy lives on.
Read the full story βWe tested INT8, NF4, and F16 configurations for streaming speech-to-text on consumer GPUs. Some paths looked promising until we pushed them. Here's the full breakdown of our quantization experiments and why F16 won.
Read the full report βMeet Pete Warden and Manjunath Kudlur β TensorFlow founding members who built Moonshine, the speech model that runs on anything.
Read more βMeet the engineers whose open-source speech models power three of VoxBar's six engines.
Read more βMeet the team who built the language model backbone that gives VoxBar AI its contextual intelligence.
Read more βMeet the brilliant Paris team behind the Vox Bar Pro engine. From three researchers to a $14 billion company.
Read more βThe French open-science lab that cracked real-time streaming speech-to-text.
Read more βHow OpenAI's release and SYSTRAN's optimization created the engine that started the offline transcription revolution.
Read more βYour voice contains biometric data. Here's what happens when you upload it to cloud services β and why local AI is the safer alternative.
Read more βDictate scripts, transcribe voiceovers, take notes during editing β privately, offline, and without monthly subscriptions.
Read more βA new startup is forging permanent AI models into physical chips. Here's how it could put VoxBar in your microphone.
Read more βIt's not a brain, a spy, or connected to the internet. It's compressed knowledge in a file. Here's what local AI actually is.
Read more βDon't have a $2,000 GPU? Hardware inference accelerators and open-source models are bringing frontier voice AI to emerging economies in a $50 USB stick.
Read more βCursor, Copilot, Claude Code β they all make you type. Overlay Mode lets you talk to any AI code editor instead.
Read more βAI understands word relationships, not individual letters. For many people, speaking is genuinely more accurate than typing.
Read more βMicrosoft put AI in Office but forgot the microphone. Overlay Mode lets you speak to Copilot, Claude in Excel, and more.
Read more βDownload and run AI locally in 5 minutes. No cloud, no API key, no subscription. A beginner's guide to Ollama.
Read more βRanked: the best AI models that run on consumer GPUs. From Llama to DeepSeek to Mistral β what to use and how much VRAM you need.
Read more βDeepSeek's Mixture of Experts breakthrough made AI models smaller, faster, and good enough to run at home. Here's what happened.
Read more βThe hottest AI models this month β from GLM-5's 744B reasoning engine to tiny on-device runners. What's worth downloading.
Read more βAI models are shrinking to fit your laptop β by design. Here's why on-device AI is the future of privacy and performance.
Read more β