###

计算机系统应用英文版:2023,32(8):95-104

View/Add Comment 过刊浏览高级检索 HTML

←前一篇 | 后一篇→

码上扫一扫！

下载全文

基于多智能体深度强化学习的协作导航应用

马佩鑫¹, 程钰¹, 侯健¹, 范庆来²

(1.浙江理工大学计算机科学与技术学院, 杭州 310018;2.浙江浙石油综合能源销售有限公司, 杭州 310012)

Cooperative Navigation Application Based on Multi-agent Deep Reinforcement Learning

MA Pei-Xin¹, CHENG Yu¹, HOU Jian¹, FAN Qing-Lai²

(1.School of Computer Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, China;2.Zhejiang Petroleum Comprehensive Energy Sales Co. Ltd., Hangzhou 310012, China)

摘要

图/表

参考文献

相似文献

本文已被：浏览 766次下载 1833次
Received:January 18, 2023 Revised:February 23, 2023

中文摘要: 多机器人协作导航目前广泛应用于搜索救援、物流等领域, 协作策略与目标导航是多机器人协作导航面临的主要挑战. 为提高多个移动机器人在未知环境下的协作导航能力, 本文提出了一种新的分层控制协作导航(hierarchical control cooperative navigation, HCCN) 策略, 利用高层目标决策层和低层目标导航层, 为每个机器人分配一个目标点, 并通过全局路径规划和局部路径规划算法, 引导智能体无碰撞地到达分配的目标点. 通过Gazebo平台进行实验验证, 结果表明, 文中所提方法能够有效解决协作导航过程中的稀疏奖励问题, 训练速度至少可提高16.6%, 在不同环境场景下具有更好的鲁棒性, 以期为进一步研究多机器人协作导航提供理论指导, 应用至更多的真实场景中.

中文关键词: 多机器人系统|协作导航|未知环境|多智能体深度强化学习|课程学习

Abstract:Multi-robot collaborative navigation is currently widely used in search and rescue, logistics, and other fields. Cooperative strategy and target navigation are the main challenges faced by multi-robot collaborative navigation. To improve the cooperative navigation ability of multiple mobile robots in an unknown environment, this study proposes a new hierarchical control cooperative navigation (HCCN) strategy. The high-level target decision layer and low-level target navigation layer are applied to assign a target point to each robot, and the global path planning and local path planning algorithms are adopted to guide the agent to reach the assigned target point without collision. Experimental verification is carried out on the Gazebo platform. The results show that the proposed method can effectively solve the sparse reward problem in cooperative navigation, and the training speed can be improved by at least 16.6%. It has better robustness in different scenarios. It is expected to provide theoretical guidance for further research on multi-robot cooperative navigation and be applied to more real scenarios.

keywords: multi-robot systems|cooperative navigation|unknown environment|multi-agent deep reinforcement learning|curriculum learning

文章编号： 中图分类号： 文献标志码：

基金项目:空间智能控制技术国防科技重点实验室2022年度国防科工局稳定支持科研项目(HTKJ2022KL502016)

引用文本：
马佩鑫,程钰,侯健,范庆来.基于多智能体深度强化学习的协作导航应用.计算机系统应用,2023,32(8):95-104
MA Pei-Xin,CHENG Yu,HOU Jian,FAN Qing-Lai.Cooperative Navigation Application Based on Multi-agent Deep Reinforcement Learning.COMPUTER SYSTEMS APPLICATIONS,2023,32(8):95-104

Author Name	Affiliation	E-mail
MA Pei-Xin	School of Computer Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, China
CHENG Yu	School of Computer Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, China
HOU Jian	School of Computer Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, China
FAN Qing-Lai	Zhejiang Petroleum Comprehensive Energy Sales Co. Ltd., Hangzhou 310012, China	Sheldonwongww@163.com