JOURNAL ARTICLE

Learning agile soccer skills for a bipedal robot with deep reinforcement learning.

Published In: Science Robotics, 2024, v. 9, n. 89. P. 1 1 of 3
Database: Applied Science & Technology Source Ultimate 2 of 3
Authored By: Haarnoja, Tuomas; Moran, Ben; Lever, Guy; Huang, Sandy H.; Tirumala, Dhruva; Humplik, Jan; Wulfmeier, Markus; Tunyasuvunakool, Saran; Siegel, Noah Y.; Hafner, Roland; Bloesch, Michael; Hartikainen, Kristian; Byravan, Arunkumar; Hasenclever, Leonard; Tassa, Yuval; Sadeghi, Fereshteh; Batchelor, Nathan; Casarini, Federico; Saliceti, Stefano; Game, Charles 3 of 3

Abstract

We investigated whether deep reinforcement learning (deep RL) is able to synthesize sophisticated and safe movement skills for a low-cost, miniature humanoid robot that can be composed into complex behavioral strategies. We used deep RL to train a humanoid robot to play a simplified one-versus-one soccer game. The resulting agent exhibits robust and dynamic movement skills, such as rapid fall recovery, walking, turning, and kicking, and it transitions between them in a smooth and efficient manner. It also learned to anticipate ball movements and block opponent shots. The agent's tactical behavior adapts to specific game contexts in a way that would be impractical to manually design. Our agent was trained in simulation and transferred to real robots zero-shot. A combination of sufficiently high-frequency control, targeted dynamics randomization, and perturbations during training enabled good-quality transfer. In experiments, the agent walked 181% faster, turned 302% faster, took 63% less time to get up, and kicked a ball 34% faster than a scripted baseline. Editor's summary: Generating robust motor skills in bipedal robots in the real world is challenging because of the inability of current control methods to generalize to specific tasks. Haarnoja et al. developed a deep reinforcement learning–based framework for full-body control of humanoid robots, enabling a game of one-versus-one soccer. The robots exhibited emergent behaviors in the form of dynamic motor skills such as the ability to recover from falls and also tactics like defending the ball against an opponent. The robot movements were faster when using their framework than a scripted baseline controller and may have potential for more complex multirobot interactions. —Amos Matsiko [ABSTRACT FROM AUTHOR]

Additional Information

Source:Science Robotics. 2024/04, Vol. 9, Issue 89, p1
Document Type:Article
Subject Area:Engineering
Publication Date:2024
ISSN:24709476
DOI:10.1126/scirobotics.adi8022
Accession Number:176964829
Copyright Statement:Copyright of Science Robotics is the property of American Association for the Advancement of Science and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)

Looking to go deeper into this topic? Look for more articles on EBSCOhost.