Newzvia

Technology | Google DeepMind's Gemini Pro X: More Hype or Real Leap?

Pankaj Mukherjee, Senior Technology Correspondent

Pankaj Mukherjee

Senior Technology Correspondent · AI, startups & MeitY policy

3 min read

Quick summary

Google DeepMind just announced Gemini Pro X, an advanced AI model that promises better understanding across text, images, and video. This new version aims for smarter reasoning, but details for Indian developers remain unclear.

Another week, another AI model. But this one from Google DeepMind, called Gemini Pro X, might actually mean something for how computers understand the world.

The company announced its launch on . They say this advanced AI model makes big jumps in multimodal understanding — meaning it can process and make sense of different types of information at the same time. Think text, images, and videos, all together.

This isn't just about reading an email. It's about looking at a video, listening to the audio, and reading any text on screen, then figuring out what's happening. Google DeepMind also highlights 'complex reasoning'. This means the AI can solve harder, trickier problems that need more than simple recall.

Beyond Text and Images

Until now, many advanced AI models (computer programs that can learn and make decisions) were good with either text or images. Combining them well has been a challenge. Adding video into that mix, and expecting the AI to 'reason' or think through a problem across all three, is a big step. Imagine an AI watching a cooking video, reading the recipe on screen, and understanding the spoken instructions. That's the promise.

This push for smarter AI isn't happening in a vacuum. Microsoft recently rolled out 'Co-Creator' features in Teams. These generative AI tools — software that creates new content — help with meeting summaries and brainstorming. It shows companies want AI to do more real-world work.

The India Question

How does Gemini Pro X impact India? That's the crucial part. Google DeepMind hasn't shared specific details about its availability or pricing for Indian developers. Will it understand the nuances of Indian languages, regional videos, or local imagery effectively? India has a massive developer community and a growing hunger for AI tools. Easy access and affordable pricing in will be key.

Meanwhile, the ethical side of AI is also gaining ground. Veritas AI, a startup focusing on 'trustworthy AI', just secured $100 million in funding. This shows that as AI gets more powerful, making sure it's fair and secure is becoming a top priority for investors too.

What We Don't Know Yet

While the announcement sounds impressive, Google DeepMind shared few specific details. We don't have benchmarks showing how much better Gemini Pro X is compared to other top models. There are no clear timelines for when developers can widely access it, or what it will cost. Is this a genuine leap, or a refinement with marketing flair? We need more than just headlines to know for sure.

The race to build the smartest AI continues. What really matters is how these powerful tools become useful, ethical, and accessible to everyone, including users and innovators in India.

Key Takeaways

  • Google DeepMind launched Gemini Pro X, a new AI model.
  • It promises better understanding of text, images, and video together.
  • Specifics on availability, pricing, or India impact are still unclear.
  • Ethical AI development is also a growing focus for investors.

Quick questions

What is multimodal understanding in AI?
AI processes and links various data types—text, images, video—simultaneously.
How does Gemini Pro X differ from older AI?
2026's Gemini Pro X targets significantly enhanced, complex reasoning across diverse media, surpassing simpler, prior AI models.
Is it available in India?
Still unclear: Google DeepMind hasn't shared specific India availability or pricing details.
So what now?

We're expecting more technical and access details soon. Its true value lies in practical, everyday applications.

This includes Indian developers.

Newzvia·19 Jun 2026

OpenAI's Prism-v2: More Than Just Text

OpenAI unveiled Prism-v2, a new AI model designed to understand and create across text, images, and video for developers. This could bring new creative and analytical tools to Indian startups, but pricing details are still awaited.
Read article
Newzvia·16 Jun 2026

Apple's iOS 18.5: Security Fixes Over New Features

Apple just rolled out iOS 18.5, an update primarily focused on patching critical security gaps and offering minor tweaks to its Safari web browser. This essential maintenance keeps iPhones safe for users, including the many here in India.
Read article
Newzvia·14 Jun 2026

OpenAI's GPT-5: AI Now Understands Text, Images, And Sound Together

OpenAI has significantly updated its GPT-5 AI model, allowing it to seamlessly understand and generate content across text, images, and audio. This advancement promises new tools for developers and businesses, including those in India, in the coming weeks.
Read article
Newzvia·11 Jun 2026

Google DeepMind's Gemini Ultra 2.0 AI Model Arrives

Google DeepMind today launched Gemini Ultra 2.0, its most advanced large language model, promising better understanding across text, images, and video. While specific India plans are not yet clear, this update could bring more powerful AI tools to users and developers here.
Read article
Newzvia·8 Jun 2026

Google DeepMind's AlphaFold 4: Mapping Life's Blueprints

Google DeepMind has unveiled AlphaFold 4, an advanced AI model that can predict protein shapes with unmatched accuracy. This breakthrough could dramatically speed up research in medicines and new materials, with potential benefits for India's scientific community.
Read article
Newzvia·4 Jun 2026

OpenAI's GPT-5 Turbo: What it means for developers

OpenAI today launched its GPT-5 Turbo model, promising better AI understanding across text, images, and audio. This update aims to give developers new tools for building smarter applications, with potential impact for India's tech scene.
Read article

More from categories

Business

View all

Technology

View all

Sports

View all