Google has just introduced a new feature, Gemini Live, which is set to compete with OpenAI’s Advanced Voice Mode for ChatGPT. Revealed during the 2024 Made by Google event, Gemini Live is now available to users who have the advanced Gemini package.
This feature is all about making conversations with AI feel more natural and fluid. With Gemini Live, users can jump between topics, cut off, or continue conversations just like they would with a real person.
It comes with a new speech engine offering ten realistic voices and even has the ability to mimic your speech in real time. The best part? You can keep the conversation going even when your phone is locked or while you’re multitasking.
Powered by Gemini 1.5 Pro and Gemini 1.5 Flash models, this feature allows for longer and more coherent conversations, thanks to its improved context window. Plus, later this year, it will support multimodal input, meaning you’ll be able to interact using visual prompts like images and videos.
For now, Gemini Live is only available in English on Android devices, but more languages and iOS support are on the way. Soon, you’ll also be able to use Gemini with other Google apps like Calendar, Keep, and YouTube Music, managing your tasks with just your voice.
On the flip side, OpenAI’s Advanced Voice Mode for ChatGPT has faced some criticism. Some users worry it could lead to over-dependence on AI, potentially affecting real-world relationships.
In response, OpenAI is working on improving its AI models to ensure they’re not just powerful but also safe and practical for everyday use.
Gemini Live looks like a game-changer for AI conversations, making interactions feel more natural and flexible. Its real-time speech mimicry and multitasking support could make it a strong competitor to OpenAI’s offerings.
Also Read: Google Launches AI-Powered Video Creation Tool “Google Vids”