Deepgram and IBM today announced a collaboration to integrate Deepgram’s industry-leading speech-to-text and text-to-speech capabilities into IBM’s watsonx Orchestrate generative AI solution. Deepgram becomes IBM’s first voice partner, providing fast, reliable, and scalable voice AI technology that enables enterprises to build voice-enabled agents and workflows. The integration addresses enterprise needs for high-performance transcription, real-time captioning, and natural-sounding speech in noisy, real-world conditions, supporting diverse languages, dialects—including dozens of Arabic and Indian variants—and regional accents.
“Voice is rapidly becoming the default interface between humans and technology, and enterprise deployments require a real-time platform that is accurate, low latency, and reliable at scale,” said Scott Stephenson, CEO and Co-Founder, Deepgram. “By embedding Deepgram inside watsonx Orchestrate Agent Builder, IBM clients can build voice agents and voice-enabled workflows on top of a real-time foundation that has been developed and refined over more than a decade.”
“Our watsonx Orchestrate integration powered by Deepgram APIs introduces new speech recognition and transcription capabilities to IBM clients, refining and modernizing their operations,” said Nick Holda, Vice President of AI Technology Partnerships at IBM. “This collaboration aims to help enterprise organizations accelerate their AI initiatives and reinforces IBM’s open ecosystem, bringing choice and cutting-edge voice technology to partners and customers.”
The partnership combines Deepgram’s real-time voice platform—proven across over 50,000 years of audio and 1 trillion words—with watsonx Orchestrate Agent Builder. This enables IBM clients to deploy voice agents and conversational workflows that meet strict regulatory and compliance standards while handling nuanced, high-stakes interactions. Enterprises gain flexible, scalable voice AI that enhances automation, improves user experiences, and modernizes operations without sacrificing accuracy or governance.
About Deepgram
Deepgram is the real-time API platform underpinning the Voice AI economy. Its Voice AI platform offers speech-to-text (STT), text-to-speech (TTS), and full speech-to-speech (STS) capabilities–all powered by its enterprise-grade runtime. 200,000+ developers build with Deepgram’s voice-native foundational models – accessed through cloud APIs or as self-hosted / on-premises APIs – due to its unmatched accuracy, low latency, and pricing. Customers include technology ISVs building voice products or platforms, co-sell partners working with large enterprises, and enterprises solving internal use cases. Having processed over 50,000 years of audio and transcribed over 1 trillion words, there is no organization in the world that understands voice better than Deepgram.
About IBM
IBM is a leading provider of global hybrid cloud and AI, and consulting expertise. We help clients in more than 175 countries capitalize on insights from their data, streamline business processes, reduce costs and gain the competitive edge in their industries. Thousands of governments and corporate entities in critical infrastructure areas such as financial services, telecommunications and healthcare rely on IBM's hybrid cloud platform and Red Hat OpenShift to affect their digital transformations quickly, efficiently and securely. IBM's breakthrough innovations in AI, quantum computing, industry-specific cloud solutions and consulting deliver open and flexible options to our clients. All of this is backed by IBM's long-standing commitment to trust, transparency, responsibility, inclusivity and service.