ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Conference Paper C-3PO: Cyclic-Three-Phase Optimization for Human-Robot Motion Retargeting based on Reinforcement Learning
Cited 14 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Taewoo Kim, Joo-Haeng Lee
Issue Date
2020-06
Citation
International Conference on Robotics and Automation (ICRA) 2020, pp.8425-8432
Publisher
IEEE
Language
English
Type
Conference Paper
DOI
https://dx.doi.org/10.1109/ICRA40945.2020.9196948
Abstract
Motion retargeting between heterogeneous polymorphs with different sizes and kinematic configurations requires a comprehensive knowledge of (inverse) kinematics. Moreover, it is non-trivial to provide a kinematic independent general solution. In this study, we developed a cyclic three-phase optimization method based on deep reinforcement learning for human-robot motion retargeting. The motion retargeting learning is performed using refined data in a latent space by the cyclic and filtering paths of our method. In addition, the human- in-the-loop based three-phase approach provides a framework for the improvement of the motion retargeting policy by both quantitative and qualitative manners. Using the proposed C- 3PO method, we were successfully able to learn the motion retargeting skill between the human skeleton and motion of the multiple robots such as NAO, Pepper, Baxter and C-3PO.
KSP Keywords
Deep reinforcement learning, Different sizes, Human Skeleton, Human-Robot, Latent space, Motion Retargeting, PO method, Phase Optimization, Reinforcement Learning(RL), Robot motion, Three phase