Newzvia

Technology | Google DeepMind's Gemini Pro X: More Hype or Real Leap?

Pankaj Mukherjee, Senior Technology Correspondent

Pankaj Mukherjee

Senior Technology Correspondent · AI, startups & MeitY policy

3 min read

Quick summary

Google DeepMind just announced Gemini Pro X, an advanced AI model that promises better understanding across text, images, and video. This new version aims for smarter reasoning, but details for Indian developers remain unclear.

Another week, another AI model. But this one from Google DeepMind, called Gemini Pro X, might actually mean something for how computers understand the world.

The company announced its launch on . They say this advanced AI model makes big jumps in multimodal understanding — meaning it can process and make sense of different types of information at the same time. Think text, images, and videos, all together.

This isn't just about reading an email. It's about looking at a video, listening to the audio, and reading any text on screen, then figuring out what's happening. Google DeepMind also highlights 'complex reasoning'. This means the AI can solve harder, trickier problems that need more than simple recall.

Beyond Text and Images

Until now, many advanced AI models (computer programs that can learn and make decisions) were good with either text or images. Combining them well has been a challenge. Adding video into that mix, and expecting the AI to 'reason' or think through a problem across all three, is a big step. Imagine an AI watching a cooking video, reading the recipe on screen, and understanding the spoken instructions. That's the promise.

This push for smarter AI isn't happening in a vacuum. Microsoft recently rolled out 'Co-Creator' features in Teams. These generative AI tools — software that creates new content — help with meeting summaries and brainstorming. It shows companies want AI to do more real-world work.

The India Question

How does Gemini Pro X impact India? That's the crucial part. Google DeepMind hasn't shared specific details about its availability or pricing for Indian developers. Will it understand the nuances of Indian languages, regional videos, or local imagery effectively? India has a massive developer community and a growing hunger for AI tools. Easy access and affordable pricing in will be key.

Meanwhile, the ethical side of AI is also gaining ground. Veritas AI, a startup focusing on 'trustworthy AI', just secured $100 million in funding. This shows that as AI gets more powerful, making sure it's fair and secure is becoming a top priority for investors too.

What We Don't Know Yet

While the announcement sounds impressive, Google DeepMind shared few specific details. We don't have benchmarks showing how much better Gemini Pro X is compared to other top models. There are no clear timelines for when developers can widely access it, or what it will cost. Is this a genuine leap, or a refinement with marketing flair? We need more than just headlines to know for sure.

The race to build the smartest AI continues. What really matters is how these powerful tools become useful, ethical, and accessible to everyone, including users and innovators in India.

Key Takeaways

  • Google DeepMind launched Gemini Pro X, a new AI model.
  • It promises better understanding of text, images, and video together.
  • Specifics on availability, pricing, or India impact are still unclear.
  • Ethical AI development is also a growing focus for investors.

Quick questions

What is multimodal understanding in AI?
AI processes and links various data types—text, images, video—simultaneously.
How does Gemini Pro X differ from older AI?
2026's Gemini Pro X targets significantly enhanced, complex reasoning across diverse media, surpassing simpler, prior AI models.
Is it available in India?
Still unclear: Google DeepMind hasn't shared specific India availability or pricing details.
So what now?

We're expecting more technical and access details soon. Its true value lies in practical, everyday applications.

This includes Indian developers.

Newzvia·30 May 2026

Apple's iOS 19.5: Vision Pro Connects, Privacy Gets Tighter

Apple rolled out iOS 19.5 today, bringing new tools for its Vision Pro headset and stronger privacy checks for Safari and Mail. For Indian iPhone users, this means a step towards future tech, even if Vision Pro isn't here yet.
Read article
Newzvia·26 May 2026

OpenAI's Proton API Promises Smarter AI

OpenAI has released its new 'Proton' API, claiming a major jump in AI's ability to understand text, images, and audio while significantly cutting down on made-up facts. Indian developers will watch closely for details on pricing and how well it handles local contexts.
Read article
Newzvia·24 May 2026

Google's Gemini Ultra 2.0: Smarter AI for Coding and More

Google has launched Gemini Ultra 2.0, its newest AI model, boasting better understanding of text, images, and video. This update could soon change how Indian developers code and how we use many Google apps.
Read article
Newzvia·21 May 2026

Google's Gemini Pro 2.0: AI for Enterprise, Now Live

Google has made its new Gemini Pro 2.0 AI model generally available for businesses and developers. This upgraded large language model promises better reasoning and coding, setting the stage for deeper AI integration in Indian enterprises.
Read article
Newzvia·19 May 2026

Google DeepMind Updates Gemini Pro for Businesses: What's New

Google DeepMind has released Gemini Pro 2.0, an updated version of its large language model, promising better understanding across text, images, and audio. While aimed at developers and enterprise, specific India details on access or pricing are not yet clear.
Read article
Newzvia·16 May 2026

OpenAI Shrinks AI: GPT-5 Nano for Devices and Businesses

OpenAI just launched GPT-5 Nano, a compact AI model designed for phones and company software. This promises faster, more private AI tools for Indian businesses and users, changing how data is handled.
Read article

More from categories

Business

View all

Technology

View all

Sports

View all