KUDO AI Engine v.2.0 for Improved Speech Translation Accuracy
KUDO AI (patent pending) represents a speech-to-speech translation solution that offers multilingual audio and captioning. This sophisticated technology empowers users to listen to speakers in their preferred language, eliminating the need to rely solely on subtitles. A standout feature of the KUDO AI Speech Translator is its capacity to operate in real-time, supporting near-simultaneous and uninterrupted translation, thus promising a frictionless user experience. This feature has been specifically designed and optimized for live translation of speeches, lectures, and presentations among others.
Recommended: Embrace AI to become a W.I.T.C.H. Leader
Underpinning this capability is our simultaneity module, based on advanced machine learning and NLP techniques, which processes and analyzes the structure of the speech as it unfolds, making informed decisions with each new word spoken. “This task is intricate due to the necessity of maintaining a delicate balance: creating short and compact segments of text to facilitate real-time translation, while also considering that machine translation often delivers better results with longer texts and more context”, says Claudio Fantinuoli, CTO and designer of KUDO AI. Achieving the right balance is crucial to enhancing translation accuracy and elevating the overall user experience.
The quality is measured by letting linguists manually evaluate a corpus of texts processed by the simultaneity module over a balanced corpus of speeches in each language. The corpus represents prototypical inputs for which the application has been designed and comprises several categories of texts, such as technical presentations, political speeches, lectures, but also more challenging ones, such as casual talks, speeches rich at disfluencies, non native speakers, etc. This evaluation is performed at each iteration and is a measure used internally to assess improvements of the engine.
Isabel Canovas, Linguistic Analyst, adds “A roughly 30% reduction of errors in the machine learning-based simultaneity module corresponds to a similar uplift in the overall translation experience”. In this scenario, it’s not just the accuracy but the precision with which the original message is conveyed in translation that sees a positive impact, but also and foremost the grammatical and syntactical naturalness of the translated output.
Recommended: AiThority Interview with Arijit Sengupta, CEO and Founder at Aible
This update to the AI Engine reflects the ongoing evolutions planned by KUDO to advance product sophistication and use cases for their Speech Translator. Currently, KUDO AI is designed for one-directional meetings and events where presenters are speaking one or multiple languages to an audience; All Hands meetings, L&D, webinars, global events, etc. Initial adoption of the solution has shown that the need for language access in these use cases remains high.
Top AI ML Insights: AiThority Interview with Alex Mans, Founder and CEO at FLYR Lab
[To share your insights with us, please write to sghosh@martechseries.com]
Comments are closed.