ETRI Knowledge Sharing Platform : Reinforcement Learning-based Path Generation using Sequential Pattern Reduction and Self-directed Curriculum Learning

BROWSE

Titles

논문 검색
Type		SCI
Year	~	Keyword

Detail

List

Journal Article Reinforcement Learning-based Path Generation using Sequential Pattern Reduction and Self-directed Curriculum Learning

Cited 8 time in scopus

Download 333 time Share share

Authors: Taewoo Kim, Joo-Haeng Lee

Issue Date: 2020-08

Citation: IEEE Access, v.8, pp.147790-147807

ISSN: 2169-3536

Publisher: IEEE

Language: English

Type: Journal Article

DOI: https://dx.doi.org/10.1109/ACCESS.2020.3015245

Abstract: Recent advancements in robots and deep learning have led to active research in human-robot interaction. However, non-physical interaction using visual devices such as laser pointers has gained less attention than physical interaction using complex robots such as humanoids. Such vision-based interaction has high potential for use in recent human-robot collaboration environments such as assembly guidance, even with a minimum amount of configuration. In this paper, we introduce a simple robotic laser pointer device that follows an arbitrary planar path and is designed to be a visual instructional aid. We also propose an image-based automatic path generation method using reinforcement learning and a sequential pattern reduction technique. However, such vision-based human-robot interaction is generally performed in a dynamic environment, and it can frequently be necessary to calibrate the devices more than once. In this paper, we avoid the need for this re-calibration process through episodic randomization learning and improved learning efficiency. In particular, contrary to previous approaches, the agent controls the curriculum difficulty in a self-directed manner to determine the optimal curriculum. To our knowledge, this is the first study of curriculum learning that incorporates an explicit learning environment control signal initiated by the agent itself. Through quantitative and qualitative analyses, we show that the proposed self-directed curriculum learning method outperforms ordinary episodic randomization and curriculum learning. We hope that the proposed method can be extended to a general reinforcement learning framework.

KSP Keywords: Control Signal, Curriculum learning, Dynamic Environment, High potential, Human robot interaction(HRI), Human-robot collaboration(HRC), Image-based, Learning framework, Learning methods, Learning-based, Path generation method

This work is distributed under the term of Creative Commons License (CCL)
(CC BY)

ETRI-Knowledge Sharing Plaform

BROWSE

Titles

Detail

ETRI