Multi-robot Reinforcement Learning Navigation Incorporating Two Levels of Attention

doi:10.15888/j.cnki.csa.009315

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-25- 17

Home > Archive>Volume 32, Issue 12, 2023 >43-51. DOI:10.15888/j.cnki.csa.009315

PDF HTML XML Export Cite reminder

Multi-robot Reinforcement Learning Navigation Incorporating Two Levels of Attention
DOI:
                        10.15888/j.cnki.csa.009315
                    
CSTR:
                        [cstr]
                    
Author:
                        ZHANG Yao-DanZHANG Yao-Dan
School of Computer Science and Technology, North University of China, Taiyuan 030051, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
KUANG Li-QunKUANG Li-Qun
School of Computer Science and Technology, North University of China, Taiyuan 030051, China;Shanxi Key Laboratory of Machine Vision and Virtual Reality (North University of China), Taiyuan 030051, China;Shanxi Province’s Vision Information Processing and Intelligent Robot Engineering Research Center, Taiyuan 030051, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
JIAO Shi-ChaoJIAO Shi-Chao
School of Computer Science and Technology, North University of China, Taiyuan 030051, China;Shanxi Key Laboratory of Machine Vision and Virtual Reality (North University of China), Taiyuan 030051, China;Shanxi Province’s Vision Information Processing and Intelligent Robot Engineering Research Center, Taiyuan 030051, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
HAN Hui-YanHAN Hui-Yan
School of Computer Science and Technology, North University of China, Taiyuan 030051, China;Shanxi Key Laboratory of Machine Vision and Virtual Reality (North University of China), Taiyuan 030051, China;Shanxi Province’s Vision Information Processing and Intelligent Robot Engineering Research Center, Taiyuan 030051, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
XUE Hong-XinXUE Hong-Xin
School of Computer Science and Technology, North University of China, Taiyuan 030051, China;Shanxi Key Laboratory of Machine Vision and Virtual Reality (North University of China), Taiyuan 030051, China;Shanxi Province’s Vision Information Processing and Intelligent Robot Engineering Research Center, Taiyuan 030051, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

To solve the low learning efficiency and slow convergence due to the complex relationship among intelligent agents in multi-agent reinforcement learning, this study proposes a two-level attention mechanism based on MADDPG-Attention. The mechanism adds soft and hard attention mechanisms to the Critic network of the MADDPG algorithm and learns the learnable experience among intelligent agents through the attention mechanism to improve the mutual learning efficiency of the agents. Since the single-level soft attention mechanism assigns learning weights to completely irrelevant intelligent agents, hard attention is employed to determine the necessity of learning between two intelligent agents, and the agents with irrelevant information are cut. Then soft attention is adopted to determine the importance of learning between two intelligent agents, and the learning weights are assigned according to the importance distribution to learn from the agents with available experience. Meanwhile, tests on a collaborative navigation environment with multi-agent particles show that the MADDPG-Attention algorithm has a clearer understanding of complex relationships and achieves a success rate of more than 90% in all three environments, which improves the learning efficiency and accelerates the convergence rate.

Key words:multi-agent reinforcement learning;navigation;MADDPG;hard attention;soft attention

Get Citation

张耀丹,况立群,焦世超,韩慧妍,薛红新.融合两级注意力的多机器人强化学习导航.计算机系统应用,2023,32(12):43-51

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:May 25,2023
Revised:June 26,2023
Adopted:
Online: September 19,2023
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063