Post Thumbnail

Google released the first local model Gemini Robotics On-Device

Google company presented the coolest model Gemini Robotics On-Device. This is the world’s first solution that combines computer vision, language understanding and physical actions in a single local package. Which frees robots from constant dependence on cloud computing!

The uniqueness of the new model lies in its universality. It works with both humanoid platforms and industrial two-handed manipulators. Impressive is also the system’s ability to perform the most complex two-handed operations. From manipulations with small objects to assembly of constructions and moving objects.

Learning efficiency also works excellently. The model needs only 100 demonstrations to master new actions! At the same time, the system was initially trained only on the ALOHA dataset with human instructions. But was able to transfer knowledge to diverse robotic platforms.

Google simultaneously released SDK Gemini Robotics. This is a toolkit for developers that allows customizing the model for specific tasks.

Fully autonomous operation for robots opens huge possibilities for application in conditions of unstable connection. Or for tasks requiring minimal response latency. And this could be the start of a new era of truly independent robots!

Autor: AIvengo
For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.

Latest News

How Robomart reduces delivery costs by 70% through robotics

$3 for any delivery. Robomart challenges giants DoorDash and Uber Eats with a business model new to the industry. Their new robot RM5 completely changes delivery economics.

Unusual collaboration between competitors in AI safety testing

Two main competitors in the world of artificial intelligence united for the first time for joint safety testing. OpenAI and Anthropic opened access to each other's secret models. In an industry where companies pay researchers up to $100 million and fight for every user, such collaboration seems incredible.

Why Gemini reached 50% of ChatGPT's mobile audience

Google Gemini already has half of ChatGPT's audience on mobile devices. This is data from a new report by venture fund Andreessen Horowitz on the consumer AI market. 2.5 years of research shows an interesting picture.

How Claude became a hacking tool for 17 organizations

Anthropic company released an analytical security report. From it becomes clear that Claude and other AI agents are becoming tools of cybercriminals. At Anthropic, they called this new direction vibe-hacking. It turns out that artificial intelligence has radically lowered barriers to entry into criminal activity.

How xAI competes with OpenAI in developer tools

xAI is launching Grok Code Fast 1. This is a compact agentic model for coding. $0.20 for 1 million input tokens, $1.50 for output — and just $0.02 when using cache!