Google's Gemini 2.0

Mazdak · December 11, 2024, 3:41pm

Unlike its predecessor, Gemini 1.5 Flash, which was text-only, 2.0 goes multimodal. It can:

Generate images and audio, natively.

Use third-party apps/services like Search.

Analyze images, videos, and audio.

And developers? You’re getting access to an experimental release starting today.

Why should you care?

Google says Gemini 2.0 is:

Twice as fast as 1.5 Flash.

A coding powerhouse with unmatched math skills.

Customizable in voice narrations.

Imagine creating apps with real-time multimodal functionality—or having AI that “thinks” like an agent, as Google puts it.

Plus, all generated content is watermarked with SynthID to combat deepfake concerns.