
Voice

How to Evaluate Voice Agents in 2025: Beyond Automatic Speech Recognition (ASR) and Word Error Rate (WER) to Task Success, Barge-In, and Hallucination-Under-Noise
Optimizing only for Automatic Speech Recognition (ASR) and Word Error Rate (WER) is insufficient for modern, interactive voice agents. Robust evaluation must measure end-to-end task success, barge-in behavior and latency, and hallucination-under-noise—alongside ASR, safety, and instruction following. VoiceBench offers a multi-facet speech-interaction benchmark across general knowledge, instruction following, safety, and robustness to speaker/environment/content variations, but…

Top 20 Voice AI Blogs and News Websites 2025: The Ultimate Resource Guide
Voice AI technology has experienced unprecedented growth in 2025, with revolutionary breakthroughs in real-time conversational AI, emotional intelligence, and voice synthesis. As enterprises increasingly adopt voice agents and consumers embrace next-generation AI assistants, staying informed about the latest developments has become crucial for professionals across industries. The global Voice AI market has reached $5.4 billion…

Emotive voice AI startup Hume launches new EVI 3 model with rapid custom voice creation
Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More New York-based AI startup Hume has unveiled its latest Empathic Voice Interface (EVI) conversational AI model, EVI 3 (pronounced “Evee” Three, like the Pokémon character), targeting everything from powering customer support systems and health coaching to…

Ask Engadget: How do I answer calls on my iPhone with only my voice?
Last August, my best friend asked me how she could help her neighbor set her iPhone so she could answer it without picking it up. The neighbor had Multiple Sclerosis (MS), and had lost dexterity in both hands over time. Some Google searches revealed I was far from alone in my confusion. So I asked…

Rime Introduces Arcana and Rimecaster (Open Source): Practical Voice AI Tools Built on Real-World Speech
The field of Voice AI is evolving toward more representative and adaptable systems. While many existing models have been trained on carefully curated, studio-recorded audio, Rime is pursuing a different direction: building foundational voice models that reflect how people actually speak. Its two latest releases, Arcana and Rimecaster, are designed to offer practical tools for…

OpenAI’s new voice AI model gpt-4o-transcribe lets you add speech to your existing text apps in seconds
Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI’s voice AI models have gotten it into trouble before with actor Scarlett Johansson, but that isn’t stopping the company from continuing to advance its offerings in this category. Today, the ChatGPT maker has unveiled three,…

A Day in the Life of a Prolific Voice Phishing Crew – Krebs on Security
Besieged by scammers seeking to phish user accounts over the telephone, Apple and Google frequently caution that they will never reach out unbidden to users this way. However, new details about the internal operations of a prolific voice phishing gang show the group routinely abuses legitimate services at Apple and Google to force a variety…

Ultimate Guide to AI Voice Recognition
Introduction What is AI Voice Recognition? AI voice recognition is a technology that allows computers and devices to understand and respond to human speech. Imagine talking to your phone or a smart speaker, and it understands what you’re saying and follows your commands. This technology makes it possible. It’s like having a conversation with a…

Voice Search Optimization At Scale: A Guide For Enterprise Marketers
Smartphones put the world at our fingertips. People have questions that need answering, as well as the services or products they need. All of these things are just a search away, and now, we’ve seen a cosmic shift from traditional search to voice search and voice assistants. Statistically, voice search and assistants are not something…