Newzvia

Artificial Intelligence | Google's Gemini Pro 1.5: Smarter AI for Businesses, Not Yet for All

Pankaj Mukherjee, Senior Technology Correspondent

Pankaj Mukherjee

Senior Technology Correspondent · AI, startups & MeitY policy

2 min read

Quick summary

Google DeepMind today launched Gemini Pro 1.5, an AI model that now understands text, images, sound, and video much better. It mainly targets large companies, raising questions about its accessibility and relevance for Indian startups and developers.

Google DeepMind has a new AI model out today, named Gemini Pro 1.5. They say it’s much smarter with different kinds of information. Think pictures, sound, and even videos. This model aims mostly at big businesses, per the official release on .

What It Does Differently

This new version, , can reason across different types of data. It understands text, images, audio, and video inputs. This is called “multimodal understanding.” It means the AI can look at a video, listen to its sound, and read any captions. Then it can make sense of all this information together.

The company claims it does this with greater accuracy and context awareness. This helps it tackle complex tasks for companies. For example, it could analyze security footage or customer service calls.

The India Question

Google DeepMind says is for enterprise use. This usually means it's for larger companies. But what does this mean for India? Our startups and smaller businesses often drive innovation. Will they get easy access?

The announcement didn't share specific details on pricing or regional availability for India. We don't know if it will cost ₹100 or ₹10,000 to use this advanced model. Support for Indian languages is also not confirmed. This matters for local developers building solutions here.

What Wasn't Said

The news focused on “significantly improved” understanding. But Google DeepMind didn't release detailed benchmarks. These help us compare its real performance against other models.

OpenAI recently rolled out its GPT-5 API, claiming “unprecedented advancements” in reasoning. Anthropic also expanded access to Claude 3 Opus for scientific research. All these companies make big claims. Without public data, it's hard to verify how much better truly is. We also don't know when regular developers can get their hands on it.

The race to build smarter AI continues. Each new model promises more. Companies need to see if these promises deliver real value. Especially in diverse markets like India. We'll be watching to see how performs in actual business settings.

Key Takeaways

  • Google DeepMind launched Gemini Pro 1.5 with enhanced multimodal understanding.
  • The AI can process text, images, audio, and video inputs together.
  • It's designed for enterprise applications, leaving questions about access for smaller Indian businesses.
  • Specific performance data and pricing for India are still unknown.

People also ask

What is multimodal AI?
AI that processes and understands varied data inputs: text, images, sound, and video.
2026-05-12 saw Google DeepMind announce this model. What does "enterprise applications" mean?
2026's announcement means the AI serves large organizations for complex business problems, examples include analyzing vast customer data or security footage.
For general public?
No — the announcement targets enterprise applications. General public access is currently unconfirmed.
So, for India?
Specific pricing and availability for India were not mentioned in the official release; these remain unconfirmed.
Newzvia·23 Jun 2026

InnovateAI Launches 'Genesis-Pro' Multimodal AI Model

InnovateAI Corp. has unveiled 'Genesis-Pro', a new AI model that generates text, images, and videos. Its actual impact and availability for Indian users still need more clarity.
Read article
Newzvia·21 Jun 2026

EU's Landmark AI Act Gets Detailed Rulebook

The European Commission just released the detailed 'how-to' rules for its landmark AI Act, focusing on strict technical checks for 'high-risk' AI systems. This move sets a crucial global standard that Indian AI developers and policy makers will be watching closely.
Read article
Newzvia·19 Jun 2026

Meta's Llama 4.5: More Than Just a Language Model?

Meta Platforms today launched Llama 4.5, an open-source model with better multimodal reasoning for businesses. This could mean big opportunities for Indian startups, but real-world deployment still has hurdles.
Read article
Newzvia·17 Jun 2026

OpenAI's Whisper-Plus: More Than Just Text or Talk

OpenAI has launched 'Whisper-Plus', a new AI model that can understand and generate text, audio, and video in real-time. For India, questions remain about its local language support and accessibility.
Read article
Newzvia·14 Jun 2026

InnovateAI Launches AuraVerse Multimodal AI Platform

InnovateAI Corp. has released AuraVerse, a new generative AI platform that creates text, images, and video content. Specific details for Indian users, including pricing and availability, are not yet public.
Read article
Newzvia·12 Jun 2026

Google Gemini Ultra 2.0: Smarter AI Sees, Hears; What About India?

Google today launched Gemini Ultra 2.0, an advanced large language model designed to understand complex video and audio inputs better. The model's potential impact and availability for Indian users and developers are still awaiting specific details.
Read article

More from categories

Business

View all

Technology

View all

Sports

View all