Google’s Gemini AI demo is too good to be true (literally)

(Image credit: Laptop Mag / Rael Hornby)

This week, Google’s December Feature Drop for Pixel devices was accompanied by a wealth of content showcasing the brand’s latest multimodal AI, Gemini. In a series of videos highlighting the software’s key and unique features, the brightest minds from Google HQ wowed us with a series of bold claims and presentations.

Enter a slow fade-in from black mixed with “generic-slow-build-piano-music.mp3” as Google CEOs Sundar Pichai and Demis Hassabis talk about their drives and passion for the Gemini project like it was an American Idol audition vignette for the brainiest boyband imaginable.

The dawn of a Gemini era

What followed was a series of talking head moments with the Googleplex’s top brass hyping up the moment as if they’d just figured out how to hardcode the return of the Messiah. Unlike most AI models, Gemini is capable of deciphering a mix of text, code, and media all at once to understand the wider context of what it is tasked with achieving.

For example, in one demonstration Gemini was pre-prompted by text to act as a kitchen assistant, and then a voice recording was added to the prompt where a user asked for instructions on how to begin making a veggie omelet using the available ingredients. Finally, a picture of the ingredients was included before Gemini was asked to generate a response.

In the video above, you'll be able to see that Gemini’s reply adopted the role of the kitchen assistant, observed the available ingredients, and then delivered the first step in veggie omelet preparation in the form of a voice note. Impressive stuff, to be sure. Especially when the user was able to show Gemini an updated picture of their omelet in process and ask Gemini how it was going, with the AI picking up that the dish was ready to be cooked on the other side.

That's AI-volution, baby

Of course, Gemini’s capabilities scale far and beyond the simple frying of an egg. To showcase Gemini’s potential in full, Google had prepared a near-six-minute “hands-on” demonstration with the AI.

Within those six minutes, Gemini was able to play a round of the shell game, solve a dot-to-dot picture, and react in real-time (sometimes unprompted) to a drawing as it was created. All of this while engaging in back-and-forth audio conversation.

Presenting a fluid conversation between user and AI, some advanced image recognition techniques, and a fair amount of personality, the demo showcased the AI of our dreams — quite an apt statement given that the interactions being shown were mostly fiction.

The Gemini lie

Following the release of the “hands-on” video, Bloomberg Opinion columnist Parmy Olsen was quick to point out the small print included in the video description, reading: “For the purposes of this demo, latency has been reduced and Gemini outputs have been shortened for brevity.”

However, according to Bloomberg, when asked for comment Google expanded on this disclaimer further — revealing that the demo didn’t happen in real-time, nor did it include spoken prompts.

In fact, a Google spokesperson revealed that the video was made by using “still image frames from the footage, and prompting via text.” In essence, the very same interaction as was shown in the previous veggie omelet example.

While ultimately still impressive, the edited video does somewhat misrepresent the experience of using Gemini — painting it as a far more capable tool than it currently is.

Outlook

It’s a shame Google attempted to hot dog and grandstand about the Gemini experience in this way, as it casts a needless shadow of doubt over its new AI model. Google was caught flatfoot by the explosion of AI over the last year and has been burning the midnight oil to play catch up to competitors like OpenAI and Microsoft. Misleading users isn’t going to help that situation.

But how egregious was this video in reality? I took to Google Bard, which Google recommends to visit if you want to take the Gemini model for an early test run, to give some of the examples seen in the “hands-on” video a go for myself. The results were… Well, not promising.

Google Bard image recognition fail, with Gemini — Bard's assumed Gemini-enhanced capabilities still leave a lot to be desired, with the AI unable to tell left and right from top and bottom, and failing to notice that one bicycle has square wheels while inventing other differences between the two entirely. (Image credit: Laptop Mag / Rael Hornby)

While slightly frustrated that Google reduced itself to Ubisoft Store levels of ‘bullshotting’ to make its new multimodal AI more captivating, I’m still impressed with what Google had to show — even if a lot of what Gemini is pictured to be is what Bard is currently touted as being capable of now (though, admittedly, failing to achieve).

Google still have some way to go before catching up with the pack. And while I feel that as a company they're more than capable of doing so, I don't think misleading videos will help in any way.

Back to Apple MacBook Pro

Acer

Apple

Asus

Lenovo

Microsoft

AMD Ryzen 5

AMD Ryzen 7

Intel Core i5

Intel Core i7

Intel Core i9

4GB RAM

8GB RAM

16GB RAM

24GB RAM

32GB RAM

64GB

128GB

256GB

512GB

1TB

2TB

4TB

13.3-inch

13.5-inch

13.6-inch

14-inch

Black

Blue

Silver

New

Refurbished

EMMC

SSD

Showing 10 of 150 deals

Filters☰

Apple MacBook Pro 16-inch M3 (2023)

(512GB Silver)

Our Review

☆☆☆☆☆

$2,899

$2,818.14

View

Apple MacBook Pro 14-inch M3 (2023)

(1TB SSD)

Our Review

☆☆☆☆☆

$1,799

$1,643.77

View

Apple MacBook Air M2 2022

(13.6-inch 256GB)

Lenovo IdeaPad Duet 5 Chromebook

(13.3-inch 128GB)

Our Review

☆☆☆☆☆

$499

$399.99

View

Asus ROG Strix Scar 18

(2TB 32GB RAM)

Our Review

☆☆☆☆☆

$4,995

View

Lenovo ThinkPad X1 Yoga (Gen 7)

(14-inch 256GB)

$1,999.99

View

Apple MacBook Pro 14-inch (2023)

(14-inch 512GB)

Our Review

☆☆☆☆☆

$1,458

View

Microsoft Surface Laptop Studio

(256GB Intel Core i5)

Our Review

☆☆☆☆☆

$1,119

View

Microsoft Surface Laptop 4 13.5"

(13.5-inch 256GB)

$619

View

Rael Hornby, potentially influenced by far too many LucasArts titles at an early age, once thought he’d grow up to be a mighty pirate. However, after several interventions with close friends and family members, you’re now much more likely to see his name attached to the bylines of tech articles. While not maintaining a double life as an aspiring writer by day and indie game dev by night, you’ll find him sat in a corner somewhere muttering to himself about microtransactions or hunting down promising indie games on Twitter.