Just a couple of hours prior to the start of Google I/O 2024 (7 p.m. ET), Google demonstrated the operation of a prototype of the new Gemini, which seems to use the video, direct, and voice messages. This demonstration represents a step forward from a chatbot-and a traditional one which, until now, have largely focused on the text messages and images.
Gemini knows how to interact with the live video
In a demonstration of the new-created, perhaps in preparation for The Google I/O, chatbot Gemini appears to act on a smartphone Pixel. In the video, Gemini uses it as a live video as well as voice messages to respond to the questions raised.
When asked, “What do you think is going on here?”, chatbot-analyses for the exact video that shows a scene that is lifted up, realizing that it's preparing for a big event. The conversation goes naturally with the Gemini, that was the answer to the questions to be assessed by identifying the characters on the screen which is referred to in The Google I/O, and provide a brief description of the event.
Demoja emphasizes on the skills of a Gemini on the combination of information by the methods of different video, audio and text to understand the context and to give the appropriate responses. In comparison with a chatbot-and the former, therefore, appears to have made significant progress in the integration of the input multimodale.
One more day until #GoogleIO! We're feeling 🤩. See you tomorrow for the latest news about IT, Search, and more. pic.twitter.com/QiS1G8GBf9
— Google (@Google) May 13, 2024
Google's challenge OpenAI
Demoja, in general, is quite impressive, not only for the use of multi-modal voice, and video applications, but also for the natyrshmërinë with which to develop the conversation. However, it is important to note that Google first showed me a demonstration of a very similar at the level of the message to be Gemini, which turned out to be a little bit too good to be true.
It is not clear whether the same holds true for this demo of the new, but the user interface displayed on the display clearly indicates that you are using the video, and Google says that it is a “prototype”.
The timing of the release of this ngacmuesi it is not a coincidence: the video was uploaded to X less than an hour before the event OpenAI, which ChatGPT, the GPT-4o, and he came with the same functionality to be shown by Google, and all of them free of charge. This shows how Google is trying to maintain the position of its leaders in the field of artificial intelligence, predicting the movements of the competitors.
Discussion about this post