Google DeepMind shows Gemini Robotics in action – WATCH what they can do

Google DeepMind has unveiled two innovative AI models as part of its Gemini Robotics series, designed to significantly enhance the capabilities of general-purpose robots. The models, known as Gemini Robotics-ER 1.5 and Gemini Robotics 1.5, work in tandem to improve reasoning, vision, and action in real-world scenarios. The Gemini Robotics-ER 1.5 acts as the planner or orchestrator, while Gemini Robotics 1.5 executes tasks based on natural language commands. This two-model approach aims to overcome the limitations of previous AI systems, which often combined planning and execution in a single unit, leading to potential errors and delays. Gemini Robotics-ER 1.5 is highlighted as a vision-language model (VLM) that excels in advanced reasoning and tool integration. It is capable of generating multi-step plans for tasks and has shown strong performance in spatial understanding benchmarks. Notably, this model can leverage external tools, including Google Search, to enhance decision-making in physical environments. Once a plan is established, the Gemini Robotics 1.5, a vision-language-action (VLA) model, translates instructions and visual inputs into precise motor commands, enabling the robot to execute the task. This model evaluates the most efficient route to complete an action while providing explanations of its decision-making process in natural language. This sophisticated system is engineered to empower robots to manage complex, multi-step commands seamlessly. For instance, a robot could efficiently sort items into compost, recycling, and trash bins by first consulting local recycling guidelines online, analyzing the items, planning the sorting process, and executing the actions accordingly. DeepMind indicates that these AI models are adaptable to various robot shapes and sizes due to their spatial awareness and flexible design. Currently, the orchestrator model, Gemini Robotics-ER 1.5, is available to developers through the Gemini API in Google AI Studio, while the VLA model is accessible to select partners. This advancement represents a significant step towards integrating generative AI into robotics, transitioning from traditional interfaces to natural language-driven control, while also separating planning from execution to minimize errors.

Sources : Mint

Published On : Sep 27, 2025, 12:15

Startups

AI Startups Experience Explosive Revenue Growth Amid Technology Boom

As businesses, both established and emerging, strive to harness the potential of artificial intelligence, numerous AI st...

TechCrunch | Jul 08, 2026, 15:50

AI Startups Experience Explosive Revenue Growth Amid Technology Boom

Gaming

Join the Quest: Gamers Unite to Unearth Revolutionary War Treasures

In an innovative twist on mobile gaming, players are being invited to embark on a thrilling treasure hunt for lost artif...

CNN | Jul 08, 2026, 11:55

Join the Quest: Gamers Unite to Unearth Revolutionary War Treasures

Market Turbulence: U.S.-Iran Tensions and Tech Giants Face Investor Scrutiny

In a day marked by heightened geopolitical tensions and mixed corporate earnings, investors are bracing for a turbulent ...

CNBC | Jul 08, 2026, 12:25

Market Turbulence: U.S.-Iran Tensions and Tech Giants Face Investor Scrutiny

Startups

Blue Origin Seeks $10 Billion Funding Boost Amid Challenges

Blue Origin, the space venture founded by billionaire Jeff Bezos, is in the process of securing $10 billion in funding, ...

TechCrunch | Jul 08, 2026, 14:15

Blue Origin Seeks $10 Billion Funding Boost Amid Challenges

Automotive

Waymo Expands Driverless Services to Four New U.S. Cities

Waymo is set to enhance its presence in the autonomous vehicle market by launching driverless rides in four additional c...

CNBC | Jul 08, 2026, 14:15

Waymo Expands Driverless Services to Four New U.S. Cities

View All News

High-quality, Cost-effective IT Outsourcing

let’s grow together!

portfolio

case study

follow us on

follow us on

Google DeepMind shows Gemini Robotics in action – WATCH what they can do

AI Startups Experience Explosive Revenue Growth Amid Technology Boom

Join the Quest: Gamers Unite to Unearth Revolutionary War Treasures

Market Turbulence: U.S.-Iran Tensions and Tech Giants Face Investor Scrutiny

Blue Origin Seeks $10 Billion Funding Boost Amid Challenges

Waymo Expands Driverless Services to Four New U.S. Cities

Collaborate with Benzatine Infotech

High-quality, Cost-effective IT Outsourcing

let’s grow together!

portfolios

case study

follow us on

follow us on

portfolio

case study

follow us on

follow us on

Google DeepMind shows Gemini Robotics in action – WATCH what they can do

AI Startups Experience Explosive Revenue Growth Amid Technology Boom

Join the Quest: Gamers Unite to Unearth Revolutionary War Treasures

Market Turbulence: U.S.-Iran Tensions and Tech Giants Face Investor Scrutiny

Blue Origin Seeks $10 Billion Funding Boost Amid Challenges

Waymo Expands Driverless Services to Four New U.S. Cities

Collaborate with Benzatine Infotech