Google DeepMind Enables Robots To Perform Novel Tasks

New Delhi, July 29 : Google has demonstrated its first vision-language-action (VLA) model for robot control that showed improved generalisation capabilities and semantic and visual understanding beyond the robotic data it was exposed to.

This includes interpreting new commands and responding to user commands by performing rudimentary reasoning, such as reasoning about object categories or high-level descriptions.

The Robotic Transformer 2 (RT-2) is a novel vision-language-action (VLA) model that learns from both web and robotics data, and translates this knowledge into generalised instructions for robotic control, according to Google DeepMind.

A traditional robot can pick up a ball and stumble when picking up a cube.

RT-2’s flexible approach enables a robot to train on picking up a ball and can figure out how to adjust its extremities to pick up a cube or another toy it’s never seen before.

“We also show that incorporating chain-of-thought reasoning allows RT-2 to perform multi-stage semantic reasoning, like deciding which object could be used as an improvised hammer (a rock), or which type of drink is best for a tired person (an energy drink),” said the DeepMind team.

The latest model builds upon Robotic Transformer 1 (RT-1) that was trained on multi-task demonstrations.

The team performed a series of qualitative and quantitative experiments on RT-2 models, on over 6,000 robotic trials.

“Across all categories, we observed increased generalisation performance (more than 3x improvement) compared to previous baselines,” the team said.

The RT-2 model shows that vision-language models (VLMs) can be transformed into powerful vision-language-action (VLA) models, which can directly control a robot by combining VLM pre-training with robotic data.

“RT-2 is not only a simple and effective modification over existing VLM models, but also shows the promise of building a general-purpose physical robot that can reason, problem solve, and interpret information for performing a diverse range of tasks in the real-world,” said Google DeepMind.

 na/

#Google #DeepMind #enables #robots #perm #tasks #Delhi #Delhi #New Delhi

Google Deepmind Enables Robots To Perform Novel Tasks

Google DeepMind enables robots to perform novel tasks

Pranav Mohanlal Works Without Pay In Spain For Food And Lodging

Ias Officer Ila Tripathi Inspirational Success Story Details Inside Goes Viral

Racist Attacks On Indian American Leader Ajay Bhutoria

Varun Tej Best Choice Matka

Prabhas Troubling With Movies Selections

Ashok Galla Shocking Comments About Gautam Details Inside Goes Viral In Social Media

The Raja Saab Budget Details Out

Manchu Lakshmi Interesting Comments On Prabhas

Pranav Mohanlal Works Without Pay In Spain For Food And Lodging

Ias Officer Ila Tripathi Inspirational Success Story Details Inside Goes Viral

Pranav Mohanlal Works Without Pay In Spain For Food And Lodging

Ias Officer Ila Tripathi Inspirational Success Story Details Inside Goes Viral

Racist Attacks On Indian American Leader Ajay Bhutoria

Varun Tej Best Choice Matka

Prabhas Troubling With Movies Selections

Ias Officer Ila Tripathi Inspirational Success Story Details Inside Goes Viral

Two Dozens Plastic Drums Being Tied Carried Roof Of A Maruti 800 Car Video Vrial

Zomato Food Rescue Will It Help Minimise Food Wastage

Viral Video Drunk Constable Unzips And Urinates In Middle Of Road Outside Police Station In Agra

Woman Being Threatened In The Name Of Cm Revanth Reddy Brother Video Viral

Why Celebrities Are Not Interested In Bigg Boss 7 Telugu

If You Do This Once In 15 Days Your Liver Will Be Clean

Boiled Chickpeas Health Benefits

How Long Does It Take To Boil An Egg

Vishnu Hurdles In Maa Industry