Google Unveils Offline AI Robot Model with Vision and Language Capabilities

Tobias • 2025年6月26日 am2:26 • Markets • 103 views

Google DeepMind has unveiled Gemini Robotics On-Device, a potentially game-changing AI model poised to revolutionize the robotics landscape.

The core innovation? This model operates entirely offline, directly on robotic hardware, eliminating the need for a cloud connection. This capability allows it to simultaneously process visual recognition, language comprehension, and action execution, enabling robots to respond in real-time to human commands seamlessly.

Specifically designed as a Visual-Language-Action (VLA) foundation model for two-armed robots, Gemini On-Device can directly interpret natural language instructions and translate them into corresponding robotic actions, a major leap forward in human-robot interaction.

没网也能用！谷歌发布离线机器人AI模型：具备视觉识别、语言理解能力

This on-device functionality gives the model a significant edge in scenarios demanding high real-time performance and reliability, such as medical procedures, disaster relief, and factory automation. The ability to avoid cloud-based delays and potential risks is a critical advantage in these high-stakes environments.

The model also boasts impressive platform adaptability, requiring minimal training to integrate with various robotic hardware setups – a crucial factor in accelerating the widespread adoption of robotics technology. However, challenges remain in refining safety protocols and advanced logical reasoning capabilities for complex environments.

Currently, the released model is built upon the Gemini 2.0 architecture and does not yet incorporate the latest Gemini 2.5 advancements. Its practical application is still in the testing phase. The launch of Gemini Robotics On-Device marks a significant strategic move by Google in the general-purpose robotics AI market, setting the stage for a direct competition with industry rivals like Nvidia’s GR00T and OpenAI’s RT-2 models.

没网也能用！谷歌发布离线机器人AI模型：具备视觉识别、语言理解能力

Original article, Author: Tobias. If you wish to reprint this article, please indicate the source:https://aicnbc.com/3395.html

Google Unveils Offline AI Robot Model with Vision and Language Capabilities

About Author

Tobias

Related News

Atomic Bomb Mushroom Cloud Lamp Sparks Outrage in Japan, Demands for Removal Over “Inhumane” Design

American Financial Group Announces $350 Million Senior Notes Offering

Terra Innovatum and GSR III Acquisition Corp. Shareholders Approve Business Combination, Anticipating $130 Million in Proceeds