Google Unveils AI Model RT-2 to Help Robots Interpret Visual and Linguistic Patterns

RT-2 was trained by Google utilizing a combination of web and robotics data, taking advantage of developments in large language models like Bard.

By Sahil Pawar

August 3, 2023

Google unveils RT-2 AI model — Image Credits: Google

With the launch of the Robotic Transformer (RT-2), a cutting-edge AI learning model, Google is making a big advancement in improving the intelligence of its robots. By improving on the preceding vision-language-action (VLA) model, RT-2 gives robots a better understanding of visual and linguistic patterns. This helps them to accurately read instructions and determine the best objects to meet particular needs.

In recent tests, researchers used a robotic arm to test RT-2 in a mock kitchen office environment. The robot was given instructions to recognise a handmade hammer, which turned out to be a rock, and to select a beverage to give a fatigued person, where it selected Red Bull. The researchers also gave the robot instructions to carry a Coke can to a photo of Taylor Swift, which revealed the robot’s unexpected preference for the well-known singer.

RT-2 was trained by Google utilizing a combination of web and robotics data, taking advantage of developments in large language models like Bard, Google’s language model. This combination of linguistic data with robotic expertise, particularly an understanding of how robotic joints should move, turned out to be a successful strategy. Additionally, RT-2 shows competence in comprehending instructions delivered in languages other than English, representing a significant advancement in the cross-lingual capabilities of AI-driven robots.

Teaching robots required laborious and time-consuming individual programming for each distinct activity prior to the development of VLA models like the RT-2. Robots can now draw from a massive database of data, thanks to the strength of these advanced models, allowing them to quickly draw conclusions and make judgements.

Not everything about the new robot is ideal, however. The robot struggled to correctly identify soda flavors in a live presentation that The New York Times covered, and it frequently mistakenly labeled fruit as the color white. These flaws emphasize the continued difficulties in enhancing AI technology for practical applications.

Google Unveils AI Model RT-2 to Help Robots Interpret Visual and Linguistic Patterns

LEAVE A REPLY Cancel reply

Most Popular

Unlocking Tomorrow: The Future of Artificial Intelligence and Its Impact on Our Lives

Unlocking the Power of Amazon Cloud Services: A Comprehensive Guide to Boost Your Business

Data Structures: A Beginner’s Guide to Organizing Information Efficiently

Google Unveils AI Model RT-2 to Help Robots Interpret Visual and Linguistic Patterns

Subscribe to our newsletter

RELATED ARTICLES

Grok 4: xAI’s Boldest AI Model Yet Brings Voice, Vision, and Reasoning to the Forefront

Perplexity’s Comet Browser Redefines AI-Powered Browsing with Agentic Search

Gemini Adds AI Magic: Turn Your Photos Into Videos with Google’s Latest Tool

LEAVE A REPLY Cancel reply

Most Popular

Unlocking Tomorrow: The Future of Artificial Intelligence and Its Impact on Our Lives

Unlocking the Power of Amazon Cloud Services: A Comprehensive Guide to Boost Your Business

Data Structures: A Beginner’s Guide to Organizing Information Efficiently