Vocapia Research continues to expand its language portfolio, and has
speech-to-text systems covering most European languages. Recently updated
conversational telephone speech-to-text transcription systems for the Russian
and Chinese languages are now available on the
VoxSigma
Web
service. A system for transcription of broadcast audio in Romanian is
also now available along with new versions for Italian and Russian, and a beta
version of a Brazilian Portuguese system. All are available as standalone
software and via Vocapia's web service.
Vocapia Research is also pleased to announce extended versions of adaptation
functionalities. On-the-fly topic adaptation makes use of accompanying
meta-data (such as texts) when processing the audio document. The data serves
to increase the lexical coverage of the speech-to-text system and to adapt the
language model to the specific domain of the audio document with the aim of
improving the transcription accuracy. Originally introduced for the French
language, on-the-fly adaptation is now also available for the Dutch, English,
German and Italian languages, with more languages to follow.
In addition, daily adaptation based on current news events helps ensure that the
system vocabulary stays up-to-date, thereby providing better coverage of hot
news items. This daily update improves the transcription accuracy on popular
topics, particularly on named entities, that is person, company/organization and
place names, that have peaky popularity in news data. Daily updated models are
currently available for broadcast data transcription in French and Italian.
Vocapia continually works on improving the accuracy and capabilities of its
technologies (speech-to-text, speech-text alignment, keyword spotting, speaker
diarization and language identification) with other releases planned before
the end of 2014.
Vocapia Research, founded in July 2000, is an R&D company and
software publisher developing and providing leading edge speech
technologies and solutions for many languages, including most major
European Union languages as well as Arabic, Mandarin, and Russian. The
Vocapia Research VoxSigma
® software suite uses advanced
language technologies such as language identification, speech
recognition, and speaker identification to transform raw audio and
audiovisual data into structured and searchable XML documents. This
technology relies on over 25 years of research at LIMSI-CNRS, with
which there is a priviledged partnership. Joint systems developed
with LIMSI have achieved top ranks in national and international
challenges of speech-to-text transcription. The most common
applications of the VoxSigma software suite are audio and audiovisual
data mining (broadcast data, podcasts, call center data), media
monitoring, and media asset management. Vocapia Research is located in
the scientific pole of the Saclay Plateau, France. Readers who wish to
get more information about Vocapia Research are invited to check out
the Vocapia Research website or use the contact information page
http://www.vocapia.com/contact.