Post Thumbnail

Gemini Robotics: new approach to AI robot control

Google introduced the Gemini Robotics system, bringing AI agents into the physical world. The company developed an advanced agent system for robot control. Capable of better reasoning and planning, interacting with humans and using tools like web search.

Inside the system, 2 models work simultaneously. Gemini Robotics-ER 1.5 and Gemini Robotics 1.5 perform different functions in robot control. The first model serves as high-level brain, analyzes environment and human actions or commands, creates detailed task execution plan and calls tools when necessary.

Gemini Robotics 1.5 acts as executor, transforming instructions into precise motor commands for robot. For example, when requested to correctly sort trash based on user location, the system works step by step.

Gemini Robotics-ER 1.5 analyzes request, accesses internet to understand trash sorting rules in specific country. Evaluates available trash and gives commands like bottle in left pile, napkin in right. The model outputs trace of its reasoning, making system more interpretable.

Gemini Robotics 1.5 receives commands from ER and transforms them into precise movement trajectories. If something in environment changes during process, ER notices this and corrects instructions. When robot shape changes, entire system doesn’t need adaptation, adjusting second model is enough.

Gemini Robotics 1.5 is a vision-language-action model, transforming visual information and instructions into robot commands, thinking before acting and explaining its process. Gemini Robotics-ER 1.5 is responsible for planning and logical decisions, can call digital tools and create step-by-step plans.

The models allow robots to execute complex multi-step tasks, learn from different device types and act more transparently and safely.

Autor: AIvengo
For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.
Latest News
Worklop epidemic or how AI kills trust in you

You've surely encountered this. Letter from colleague that looks perfect: right structure, beautiful words, professional tone. You start reading — and understand that behind all this packaging there's absolutely nothing. No specifics, no solutions, just beautifully packaged emptiness. Congratulations: you just encountered workslop.

AI isn't smarter than people: A simple test will show everything

Artificial intelligence is smarter than most people. This thought comes to mind of almost everyone who regularly uses modern language models. And you know what? This thought is based on our perception error.

OpenAI DevDay 2025 Overview: Breakdown of All Announcements

OpenAI DevDay 2025 — important event in artificial intelligence world. And this is not just another presentation. I gathered all important facts, features, opinions for you and you'll learn everything most interesting that OpenAI CEO Sam Altman told.

Google DeepMind explores formation of parallel AI economy

Interesting concept of AI economy is presented in new Google DeepMind research. Link in description. Scientists analyzed rapidly forming reality. In which AI agents transform into independent economic players, capable of trading, negotiating and creating value without direct human participation. And if this process remains without proper control, autonomous systems may form their own parallel economy, closely connected to human one. Which carries both enormous opportunities and serious risks.

Oracle overtakes cloud giants thanks to bet on artificial intelligence

It turns out Oracle is demonstrating impressive growth, overtaking traditional cloud computing leaders. And masterfully using the AI wave to its advantage.