Optimized Architecture for Cooperative Multi-agent Reinforcement Learning

doi:10.15888/j.cnki.csa.009636

WeChat

Mobile website

Home > Archive>Volume 33, Issue 11, 2024 >79-89. DOI:10.15888/j.cnki.csa.009636

PDF HTML XML Export Cite reminder

Optimized Architecture for Cooperative Multi-agent Reinforcement Learning
DOI:
                        10.15888/j.cnki.csa.009636
                    
CSTR:
                        
Author:
                        
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Numerous real-world tasks require the collaboration of multiple agents, often with limited communication and incomplete observations. Deep multi-agent reinforcement learning (Deep-MARL) algorithms show remarkable effectiveness in tackling such challenging scenarios. Among these algorithms, QTRAN and QTRAN++ are representative approaches capable of learning a broad class of joint-action value functions with strong theoretical guarantees. However, the performance of QTRAN and QTRAN++ is hindered by their reliance on a single joint action-value estimator and their neglect of preprocessing agent observations. This study introduces a novel algorithm called OPTQTRAN, which significantly improves upon the performance of QTRAN and QTRAN++. Firstly, the study proposes a dual joint action-value estimator structure that leverages a decomposition network module to compute additional joint action-values. To ensure accurate computation of joint action-value estimators, it designs an adaptive network that facilitates efficient value function learning. Additionally, it introduces a multi-unit network that groups agent observations into different units for effective estimation of utility functions. Extensive experiments conducted on the widely-used StarCraft benchmark across diverse scenarios demonstrate that the proposed approach outperforms state-of-the-art MARL methods.

Reference

Cited by

Get Citation

刘玮,程旭,李浩源.优化的协作多智能体强化学习架构.计算机系统应用,2024,33(11):79-89

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:February 27,2024
Revised:May 06,2024
Adopted:
Online: September 24,2024
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063