ETRI-Knowledge Sharing Plaform

ENGLISH

성과물

논문 검색
구분 SCI
연도 ~ 키워드

상세정보

학술대회 Learning Cooperative Intrinsic Motivation in Multi-Agent Reinforcement Learning
Cited 0 time in scopus Download 5 time Share share facebook twitter linkedin kakaostory
저자
홍승진, 이상광
발행일
202110
출처
International Conference on Information and Communication Technology Convergence (ICTC) 2021, pp.1697-1699
DOI
https://dx.doi.org/10.1109/ICTC52510.2021.9620745
협약과제
21IH4300, Game Now : e-스포츠 서비스를 위한 인공지능 기반 실시간 게임 분석 기술 개발, 이상광
초록
The cooperative behavior is important skill in many real-world applications. Recently, many works have used the multi-agent platform to solve the real-world applications. However, it is difficult to learn the cooperative behaviors with equal rewards that the environment provides without considering the contributions. In this paper, we propose a method for learning cooperative behaviors in the centralized multi-agent environment. Firstly, we implement a reward model to predict the average rewards of all agents. and then, we use the reward model for calculating the contributions. The proposed method allows the model to distinguish which agent behaves better for team success. In order to evaluate the performance of the proposed method, we compute the average team rewards on the multiagent battle environment. Experimental results show that the proposed method has better performance than the baseline using the cooperative behaviors.