Quick News Spot

Exclusive: Deepgram launches voice agent API that brings AI conversations to life - SiliconANGLE


Exclusive: Deepgram launches voice agent API that brings AI conversations to life - SiliconANGLE

Exclusive: Deepgram launches voice agent API that brings AI conversations to life

Deepgram Inc., the developer of a speech recognition engine that provides its service via application programming interfaces, today announced a powerful addition to its platform that enables natural-sounding conversations between humans and artificial intelligence agents at large scale in real time.

Using speech recognition and voice synthesis AI models, Deepgram's voice agent systems ensure human-like responsiveness by making. In this release, the company is offering a system that packages all the pieces together under a single API.

All users have to do is set up a prompt and tell it what they want it to do such as tell it what they want it to do and the system manages the rest. In the past, using Deepgram developers would have had to connect together multiple parts of the system such as hooking in a large language model provider, the company's voice-to-text speech recognition model and the speech synthesis model.

"We have a big shift that's happening in the world right now," Scott Stephenson, co-founder and chief executive of Deepgram, said in an interview with SiliconANGLE. "AI went mainstream over the last two years and voice AI has gone mainstream over the last two to six months. There's a fundamental shift around the nature of how work is going to be done."

Stephenson said that having voice intractability fits into any place where a device has a microphone and a speaker such as websites, phones, mobile, AI pendants and even the drive-through. One example of where AI agents are already in use across the industry is in call centers, where agents can pick up the phone quickly so that there's little to no wait time for customers so that they can have questions answered or easy situations resolved.

"As we watch our children use their smartphones, it's obvious that voice-to-voice will become a standard method of human and machine interactions,"said Kevin Petrie, vice president of research at BARC US. "Deepgram's Voice Agent API addresses this market opportunity and makes customer service -- already a top use case for gen AI -- easier by converting text conversations to speech. Deepgram also broadens the market opportunity by integrating with a wide array of large language models."

This year saw the launch of several LLMs that can deliver natural voice conversation capabilities. The biggest examples include OpenAI's GPT-4o, Gemini Live from Google LLC and Tenyx Voice from Tenyx Inc.

Previous articleNext article

POPULAR CATEGORY

corporate

2851

tech

3144

entertainment

3422

research

1436

misc

3640

wellness

2681

athletics

3554