VoxSigma language additions: Russian and Chinese for telephone speech, Romanian for broadcast speech (2014)

Orsay - November 3, 2014

Vocapia Research continues to expand its language portfolio, and has speech-to-text systems covering most European languages. Recently updated conversational telephone speech-to-text transcription systems for the Russian and Chinese languages are now available on the VoxSigma Web service. A system for transcription of broadcast audio in Romanian is also now available along with new versions for Italian and Russian, and a beta version of a Brazilian Portuguese system. All are available as standalone software and via Vocapia's web service.

Vocapia Research is also pleased to announce extended versions of adaptation functionalities. On-the-fly topic adaptation makes use of accompanying meta-data (such as texts) when processing the audio document. The data serves to increase the lexical coverage of the speech-to-text system and to adapt the language model to the specific domain of the audio document with the aim of improving the transcription accuracy. Originally introduced for the French language, on-the-fly adaptation is now also available for the Dutch, English, German and Italian languages, with more languages to follow.

In addition, daily adaptation based on current news events helps ensure that the system vocabulary stays up-to-date, thereby providing better coverage of hot news items. This daily update improves the transcription accuracy on popular topics, particularly on named entities, that is person, company/organization and place names, that have peaky popularity in news data. Daily updated models are currently available for broadcast data transcription in French and Italian.

Vocapia continually works on improving the accuracy and capabilities of its technologies (speech-to-text, speech-text alignment, keyword spotting, speaker diarization and language identification) with other releases planned before the end of 2014.

About Vocapia Research

Vocapia Research, founded in July 2000, is an R&D company and software publisher developing and providing leading edge speech technologies and solutions for many languages, including most major European Union languages as well as Arabic, Mandarin, and Russian. The Vocapia Research VoxSigma^® software suite uses advanced language technologies such as language identification, speech recognition, and speaker identification to transform raw audio and audiovisual data into structured and searchable XML documents. This technology relies on over 25 years of research at LIMSI-CNRS, with which there is a priviledged partnership. Joint systems developed with LIMSI have achieved top ranks in national and international challenges of speech-to-text transcription. The most common applications of the VoxSigma software suite are audio and audiovisual data mining (broadcast data, podcasts, call center data), media monitoring, and media asset management. Vocapia Research is located in the scientific pole of the Saclay Plateau, France. Readers who wish to get more information about Vocapia Research are invited to check out the Vocapia Research website or use the contact information page http://www.vocapia.com/contact.