DeepL, a leader in Language AI, has officially launched DeepL Voice-to-Voice, a sophisticated product suite designed to facilitate real-time spoken communication across various business environments. This expansion into speech-to-speech translation aims to eliminate language barriers in virtual meetings, in-person interactions, and customer-facing touchpoints via API integration. By combining high-level voice models with its established translation AI, DeepL is positioning itself to provide instant, natural communication for global teams without the traditional reliance on human interpreters.
Real-Time Capabilities: DeepL Voice-to-Voice enables instant spoken translation for virtual platforms like Zoom and Microsoft Teams.
Multi-Platform Support: The suite includes mobile and web applications for in-person conversations and workshops accessible via QR codes.
Enterprise Integration: A Voice-to-Voice API allows businesses to embed real-time translation into internal tools and contact centers.
Technical Precision: New customization features allow for the accurate translation of industry-specific terminology and company-specific glossaries.
Language Expansion: The launch supports over 40 languages, including all 24 official EU languages and several Southeast Asian and Middle Eastern languages.
Performance Benchmarks: Independent evaluations show linguists preferred DeepL Voice over native solutions from Google and Microsoft by 96%.
The DeepL Voice product suite is structured to address specific organizational pain points. "Voice for Meetings" allows participants to speak their native language in digital environments while others receive the translation in theirs. For physical settings, "Voice for Conversations" and "Group Conversations" facilitate multilingual exchanges in workshops or training sessions. This ensures frontline workers and global teams maintain a shared understanding during hands-on interactions.
A significant component of this launch is the "Voice-to-Voice API," which offers businesses the flexibility to integrate these capabilities into custom applications. To ensure accuracy in technical environments, DeepL has introduced quality optimization tools. These tools ensure that fast or technical speech—including product names and specific terminology—is captured correctly by integrating user-defined translation glossaries.
Beyond voice capabilities, the company is evolving its core offering into the next-generation DeepL Translator platform. This move aims to centralize translation operations into an AI-first infrastructure, addressing the slow and manual coordination often found in traditional translation management. The new platform focuses on "Translation Flow," where content moves through existing systems automatically with the correct tone and terminology applied.
The platform also introduces "Translation Quality Assessment," providing teams with clear criteria to evaluate the reliability of a translation. This feedback loop, combined with direct editing capabilities, allows the AI to learn from corrections, ensuring that translations continuously improve and adapt to the specific nuances of a business over time.
Independent evaluations conducted by Slator revealed high quality scores for DeepL’s integrations. DeepL Voice for Zoom and Microsoft Teams achieved quality scores of 96.4/100 and 96.3/100, respectively. These figures highlight a significant performance gap between DeepL and native platform translation solutions.
Jarek Kutylowski, Founder & CEO of DeepL said: "Today, we reach another frontier in translation: real-time, spoken communication. Our mission has always been to break down language barriers and we've now overcome one of the biggest of all. DeepL Voice-to-Voice allows everyone to speak naturally in their own language without the friction or cost of interpreters. We're fusing world-class voice models with the gold-standard translation AI we've been pushing to new heights. Now, expertise is all that counts, not language."
Yoichi Okuyama, Head of DX System Department at Pioneer, noted: "Relying solely on English proficiency for global collaboration often slowed us down, as team members hesitated to contribute complex ideas. By implementing DeepL Voice, we've removed that friction and created a more inclusive environment where everyone can speak confidently in their native language."
Geoffrey Wright, Global Solution Owner - GenAI and Digital Experience at Mondelēz International, commented: "At Mondelēz, we don't settle for slow—on the road or in our workflows. Our old translation process was like driving on a flat tyre, but DeepL is full service at 100 mph. By embedding their Language AI, teams like M&A and Legal are handling sensitive documents with top speed and total confidentiality."
The transition toward real-time, AI-driven language infrastructure marks a shift from viewing translation as a technical necessity to viewing it as a strategic enabler for business speed and operational efficiency.
DeepL is a global AI company building the language infrastructure that powers global business. More than 200,000 business teams and millions of individuals use DeepL's Language AI platform to communicate globally, collaborate and operate across languages in real time. By combining breakthrough AI models with enterprise-grade security and privacy, DeepL enables organizations to work seamlessly across markets and cultures. Founded in 2017 by CEO Jarek Kutylowski, DeepL now has more than 1,000 employees and is backed by leading investors including Benchmark, IVP and Index Ventures