Newzvia

Artificial Intelligence | Google's Gemini Pro 1.5: Smarter AI for Businesses, Not Yet for All

Pankaj Mukherjee, Senior Technology Correspondent

Pankaj Mukherjee

Senior Technology Correspondent · AI, startups & MeitY policy

2 min read

Quick summary

Google DeepMind today launched Gemini Pro 1.5, an AI model that now understands text, images, sound, and video much better. It mainly targets large companies, raising questions about its accessibility and relevance for Indian startups and developers.

Google DeepMind has a new AI model out today, named Gemini Pro 1.5. They say it’s much smarter with different kinds of information. Think pictures, sound, and even videos. This model aims mostly at big businesses, per the official release on .

What It Does Differently

This new version, , can reason across different types of data. It understands text, images, audio, and video inputs. This is called “multimodal understanding.” It means the AI can look at a video, listen to its sound, and read any captions. Then it can make sense of all this information together.

The company claims it does this with greater accuracy and context awareness. This helps it tackle complex tasks for companies. For example, it could analyze security footage or customer service calls.

The India Question

Google DeepMind says is for enterprise use. This usually means it's for larger companies. But what does this mean for India? Our startups and smaller businesses often drive innovation. Will they get easy access?

The announcement didn't share specific details on pricing or regional availability for India. We don't know if it will cost ₹100 or ₹10,000 to use this advanced model. Support for Indian languages is also not confirmed. This matters for local developers building solutions here.

What Wasn't Said

The news focused on “significantly improved” understanding. But Google DeepMind didn't release detailed benchmarks. These help us compare its real performance against other models.

OpenAI recently rolled out its GPT-5 API, claiming “unprecedented advancements” in reasoning. Anthropic also expanded access to Claude 3 Opus for scientific research. All these companies make big claims. Without public data, it's hard to verify how much better truly is. We also don't know when regular developers can get their hands on it.

The race to build smarter AI continues. Each new model promises more. Companies need to see if these promises deliver real value. Especially in diverse markets like India. We'll be watching to see how performs in actual business settings.

Key Takeaways

  • Google DeepMind launched Gemini Pro 1.5 with enhanced multimodal understanding.
  • The AI can process text, images, audio, and video inputs together.
  • It's designed for enterprise applications, leaving questions about access for smaller Indian businesses.
  • Specific performance data and pricing for India are still unknown.

People also ask

What is multimodal AI?
AI that processes and understands varied data inputs: text, images, sound, and video.
2026-05-12 saw Google DeepMind announce this model. What does "enterprise applications" mean?
2026's announcement means the AI serves large organizations for complex business problems, examples include analyzing vast customer data or security footage.
For general public?
No — the announcement targets enterprise applications. General public access is currently unconfirmed.
So, for India?
Specific pricing and availability for India were not mentioned in the official release; these remain unconfirmed.
Newzvia·2 Jun 2026

Gemini 2.0 Arrives: What Google Claims, What's Missing

Google DeepMind today launched Gemini 2.0, its latest AI model with big promises for better reasoning and code. But specific details for Indian users and developers remain unsaid.
Read article
Newzvia·30 May 2026

Google's Gemini Apex: New AI Model, Old Questions

Google DeepMind today launched Gemini Apex, an advanced large language model that understands video, audio, and text in real-time. But critical details like pricing for India and training data transparency remain unclear.
Read article
Newzvia·27 May 2026

Google's Gemini 2.5 Pro: More Capable, Still Vague

Google has launched Gemini 2.5 Pro, an upgraded AI model that better understands text, images, and video, alongside a much larger 'memory.' Indian developers might find new uses, but key details like local language support and pricing remain unconfirmed.
Read article
Newzvia·24 May 2026

Nebula-7: New Open-Source AI Model Promises Global Research Boost

The AI Open Research Consortium just released 'Nebula-7', a new open-source AI model that can understand different kinds of information. This move could help Indian developers and researchers innovate more easily.
Read article
Newzvia·22 May 2026

EU Countries Act to Enforce World's First AI Law

Key European Union nations, including Germany and France, are setting up special bodies to enforce the new EU AI Act. This move means Europe is serious about making its AI rules a reality, prompting questions for India.
Read article
Newzvia·19 May 2026

Anthropic's Claude 4.5: Better Reasoning, Less Hallucination?

Anthropic has launched Claude 4.5, its new AI model, claiming it understands text, images, and audio better, and makes fewer mistakes. For Indian users and businesses, the model's true capabilities and pricing are still unclear.
Read article

More from categories

Business

View all

Technology

View all

Sports

View all