SeamlessM4T: Meta Introduces New All-in-One Multilingual AI Speech Translation Model for 100 Languages
It can also support text-to-speech translation, supporting nearly 100 input languages and 35 (including English) output languages. “We’re also releasing the metadata of SeamlessAlign, the biggest open multimodal translation dataset to date, totalling 270,000 hours of mined speech and text alignments,” Meta said in a blog post.
San Francisco, August 22: Heating up the artificial intelligence (AI) race, Meta on Tuesday launched a new all-in-one, multilingual multimodal AI translation and transcription model for up to 100 languages depending on the task. Called ‘SeamlessM4T,’ the single model can perform speech-to-text, speech-to-speech, text-to-speech, and text-to-text translations.
'SeamlessM4T' supports speech recognition for nearly 100 languages, speech-to-text translation for nearly 100 input and output languages, speech-to-speech translation, supporting nearly 100 input languages and 36 (including English) output languages and text-to-text translation for nearly 100 languages. Facebook Parent Meta Issues Stern Warning to Employees Flouting 'Three-Day Work From Office' Rule, Says Repeated Violation Could Lead to Termination.
It can also support text-to-speech translation, supporting nearly 100 input languages and 35 (including English) output languages. “We’re also releasing the metadata of SeamlessAlign, the biggest open multimodal translation dataset to date, totalling 270,000 hours of mined speech and text alignments,” Meta said in a blog post.
Last year, Meta released No Language Left Behind (NLLB), a text-to-text machine translation model that supports 200 languages, and has since been integrated into Wikipedia as one of the translation providers.
"We also shared a demo of our Universal Speech Translator, which was the first direct speech-to-speech translation system for Hokkien, a language without a widely used writing system,” said the company. WhatsApp Video Message Feature: iPhone Users Can Record and Send Short Video Messages on Meta-Owned Messaging Platform.
Earlier this year, we revealed Massively Multilingual Speech, which provides speech recognition, language identification and speech synthesis technology across more than 1,100 languages.
'SeamlessM4T' draws on findings from all of these projects to enable a multilingual and multimodal translation experience stemming from a single model, built across a wide range of spoken data sources with state-of-the-art results, Meta noted.
(The above story first appeared on LatestLY on Aug 22, 2023 10:02 PM IST. For more news and updates on politics, world, sports, entertainment and lifestyle, log on to our website latestly.com).