Home
Technology
AI
Google unveils Veo 2...

Google unveils Veo 2 Video AI Generator to rival OpenAI's Sora

17 Dec 2024 8:28 PM IST

Google has launched its latest AI video generator, Veo 2, which aims to compete with OpenAI's Sora. The Veo 2 model is an upgrade from the original Veo AI and promises to deliver realistic motion and high-quality output up to 4K, surpassing other leading AI video generation platforms.

Key Highlights:

Advanced AI Model: Veo 2 outperforms existing models such as Meta Movie Gen and Sora Turbo, according to human evaluations of its performance.

Not Yet Available in India: The Veo 2 video generator is currently not accessible in the Indian market.

Google Introduces Veo 2, Imagen 3, and Whisk AI Models

Google showcased Veo 2's capabilities through short video clips that display hyper-realistic videos of animals, food, and animated humans, each lasting 8 seconds. These clips demonstrate the model's ability to generate visually compelling content.

"Veo 2 outperforms other leading video generation models based on human evaluations of its performance," stated Google, implicitly referring to competitors like OpenAI's Sora. The company's benchmark graph indicates that Veo 2 is preferred over Meta Movie Gen, Kling V1.5, Minimax, and Sora Turbo.

While Veo 2 has shown remarkable improvements, Google acknowledges some challenges. Certain scenes with complex motions still exhibit minor inaccuracies, and details can be missing in parts of a frame. Google DeepMind commented, "While Veo 2 demonstrates incredible progress, creating realistic, dynamic, or intricate videos and maintaining complete consistency throughout complex scenes or those with complex motion remains a challenge. We’ll continue to develop and refine performance in these areas."

Enhanced Image Generation with Imagen 3

The new Imagen 3 model from Google can produce brighter and more realistic images with vibrant hues, better color balance, and high fidelity. It is capable of generating highly detailed textures and appealing visuals, offering styles ranging from photorealism to impressionism, abstracts, and anime.

Innovative Image Creation with Whisk

Whisk, a new experimental AI model from Google Labs, allows users to create images by prompting with other images instead of words. Users can upload multiple photos to categories such as Subject, Scene, and Style to generate new composite images. For example, by combining a personal photo (Subject), a mountain view (Scene), and an animated style (Style), Whisk can create a unique image.

Additionally, the Gemini model assists by automatically writing detailed captions for your images, feeding these descriptions into Imagen 3. This enables easy remixing of subjects, scenes, and styles in creative ways.

Availability

Currently, these advanced tools are available to users in the US, but Google plans to introduce them to the Indian market soon.

Stay tuned for more updates on Google’s latest innovations in AI technology.