ETRI Knowledge Sharing Platform : Combining Reward Shaping and Curriculum Learning for Training Agents with High Dimensional Continuous Action Spaces

Titles

논문 검색
Type		SCI
Year	~	Keyword

List

Conference Paper Combining Reward Shaping and Curriculum Learning for Training Agents with High Dimensional Continuous Action Spaces

Cited 9 time in scopus

Citation: International Conference on Information and Communication Technology Convergence (ICTC) 2018, pp.1391-1393

Abstract: The needs for training agent with high dimensional continuous action spaces will increase as the robot hardware such as robotic arms and humanoid robots are becoming more and more sophisticated. However, it is difficult and time-consuming task. To tackle the problem, we combine reward shaping and curriculum learning. More specifically, the rewards are provided to the agent for every step it takes and the difficulty of the problem gradually increases depending on the agent learning. Both reward function and curriculum are designed to make the agent achieve its objective. The simulation results demonstrate that the proposed scheme outperforms the comparisons.

KSP Keywords: Continuous action, Curriculum learning, High-dimensional, Humanoid Robot, reward function, robot hardware, robotic arm, simulation results

218 Gajeong-ro, Yuseong-gu, Daejeon, 34129, KOREA, Contact: sh.kim@etri.re.kr

Please refrain from automatic collection of e-mail addresses posted on this homepage.