GPT-4o Rival: Kyutai Labs Launches Moshi AI Chatbot With Real-Time Voice Features; Check Details

Kyutai Labs has launched Moshi AI, a real-time voice-interactive chatbot, likely to be a free alternative to OpenAI's GPT-4o. Moshi AI chatbot features the ability to express emotions, respond in various speaking styles and handle simultaneous audio streams.

Moshi AI Chatbot (Photo Credits: X/@kyutai_labs)

New Delhi, July 7: Kyutai Labs, a French AI firm, has launched Moshi AI, an artificial intelligence (AI) chatbot that responds verbally in real-time. Moshi AI chatbot by Kyutai Labs brings a new approach to AI-driven conversations with its real-time voice capabilities. As per multiple reports, AI chatbot Moshi is touted as a rival to OpenAI's GPT-4o, which also recently announced similar speech features. The innovative AI chatbot is likely to offer its users a more interactive experience with the added benefit of voice interactions.

According to a report of Gadgets360, Moshi AI chatbot With Real-Time Voice Features Launched by Kyutai Labs as GPT-4o Rival. The company states that the AI model was developed in-house and can modulate the voice to express emotions and respond in various speaking styles. OpenAI has revealed plans to offer similar speech features with the release of GPT-4o, however, it is yet to be released. OpenAI Sora-Rival Gen-3 Alpha Text-to-Video Generator by Runway AI Now Available for Everyone; Check Details.

Kyutai Labs Introduces Moshi AI Chatbot

Kyutai Labs Moshi AI Chatbot Unveiled: Watch Live Demo

Moshi AI Chatbot Features

Kyutai Labs has released the Moshi AI chatbot for public use and it is available for free. The platform has a simple interface. Users can monitor the loudness of their voice when they speak. The text box shows only the AI's responses, while another box at the top displays technical details like audio duration, latency and missed audio. The AI model can think, speak and listen simultaneously to keep the conversation smooth. It can also connect to the internet to look up information for queries. The Moshi AI chatbot operates solely through voice interactions and does not support text prompts. Ola Cabs CEO Bhavish Aggarwal Launches Ola Maps As In-House Alternative to Google Maps.

The AI model limits chats to 5 minutes. Built on a 7B parameter large language model (LLM) named Helium, the chatbot can be used by everyone and can speak in different accents and 70 unique emotional and speaking styles, according to a report from Indian Express. The AI chatbot responds in 200 milliseconds. Moshi can also manage two audio streams at once, meaning it can listen and talk at the same time.

(The above story first appeared on LatestLY on Jul 07, 2024 07:22 PM IST. For more news and updates on politics, world, sports, entertainment and lifestyle, log on to our website latestly.com).

Share Now

Share Now