Google released the first local model Gemini Robotics On-Device
Google company presented the coolest model Gemini Robotics On-Device. This is the world’s first solution that combines computer vision, language understanding and physical actions in a single local package. Which frees robots from constant dependence on cloud computing!
The uniqueness of the new model lies in its universality. It works with both humanoid platforms and industrial two-handed manipulators. Impressive is also the system’s ability to perform the most complex two-handed operations. From manipulations with small objects to assembly of constructions and moving objects.
Learning efficiency also works excellently. The model needs only 100 demonstrations to master new actions! At the same time, the system was initially trained only on the ALOHA dataset with human instructions. But was able to transfer knowledge to diverse robotic platforms.
Google simultaneously released SDK Gemini Robotics. This is a toolkit for developers that allows customizing the model for specific tasks.
Fully autonomous operation for robots opens huge possibilities for application in conditions of unstable connection. Or for tasks requiring minimal response latency. And this could be the start of a new era of truly independent robots!
Autor: AIvengo
For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.
AI fraudster posed as Keanu Reeves for 2.5 years and stole $160,000Dianne Ringstaff became a victim of sophisticated fraud when she was playing a mobile game and received a message followed by a video call. Artificial intelligence technologies were so advanced that the woman was absolutely certain — the real Keanu Reeves was calling her.
70-kilogram humanoid flies on turbinesItalian engineers accomplished the incredible. The metallic flying humanoid iRonCub3 with human proportions weighing 70 kg flies! 4 powerful turbines lifted the humanoid to a height of 50 cm, demonstrating technology that previously existed only in science fiction.
MIT and Microsoft exposed GPT-3.5's liesA team of scientists from MIT and Microsoft developed a methodology that allows looking behind the scenes of language models' thinking. And understanding when they lie to us. The research reveals disturbing cases of systematic mismatch between the real reasons for models' decisions and their verbal explanations.
OpenAI lures Microsoft clients with discountsOpenAI company began providing significant discounts on ChatGPT corporate subscriptions — from 10 to 20%! But discounts are available with additional investments in other OpenAI products, including Deep Research, Codex and increased API expenses. And this unprecedented move causes serious concern at Microsoft.
GigaChat lost to Claude and Gemini in Russian language on MERA benchmarkTesting GigaChat reveals the harsh truth about Russia's place in the global artificial intelligence race. Recent tests on the MERA benchmark showed results that force serious reflection. The Russian model, created specifically for working with Russian language, unexpectedly lost to foreign competitors in its own "native element".