ETRI Knowledge Sharing Platform : MuDE: Multi-Agent Decomposed Reward-Based Exploration

Titles

논문 검색
Type		SCI
Year	~	Keyword

List

Journal Article MuDE: Multi-Agent Decomposed Reward-Based Exploration

Cited 3 time in scopus

Download 799 time Share share

Authors: Byunghyun Yoo, Sungwon Yi, Hyunwoo Kim, Younghwan Shin, Ran Han, Seungwoo Seo, Hwa Jeon Song, Euisok Chung, Jeongmin Yang

Abstract: In cooperative multi-agent reinforcement learning, agents jointly optimize a centralized value function based on the rewards shared by all agents and learn decentralized policies through value function decomposition. Although such a learning framework is considered effective, estimating individual contribution from the rewards, which is essential for learning highly cooperative behaviors, is difficult. In addition, it becomes more challenging when reinforcement and punishment, help in increasing or decreasing the specific behaviors of agents, coexist because the processes of maximizing reinforcement and minimizing punishment can often conflict in practice. This study proposes a novel exploration scheme called multi-agent decomposed reward-based exploration (MuDE), which preferably explores the action spaces associated with positive sub-rewards based on a modified reward decomposition scheme, thus effectively exploring action spaces not reachable by existing exploration schemes. We evaluate MuDE with a challenging set of StarCraft II micromanagement and modified predator–prey tasks extended to include reinforcement and punishment. The results show that MuDE accurately estimates sub-rewards and outperforms state-of-the-art approaches in both convergence speed and win rates.

KSP Keywords: Cooperative behaviors, Cooperative multi-agent, Decomposition scheme, Function decomposition, Learning framework, Reward-Based, StarCraft II, Value function, convergence speed, multi-agent reinforcement learning, reinforcement learning(RL)

This work is distributed under the term of Creative Commons License (CCL)
(CC BY NC ND)

218 Gajeong-ro, Yuseong-gu, Daejeon, 34129, KOREA, Contact: sh.kim@etri.re.kr

Please refrain from automatic collection of e-mail addresses posted on this homepage.