###
计算机系统应用英文版:2022,31(8):125-132
←前一篇   |   后一篇→
本文二维码信息
码上扫一扫!
基于决策知识学习的多无人机航迹协同规划
(1.解放军31008部队, 北京 100091;2.国防科技大学 系统工程学院, 长沙 410073)
Trajectory Collaborative Planning of Multi-UAV Based on Decision-making Knowledge Learning
(1.PLA 31008 Unit, Beijing 100091, China;2.College of Systems Engineering, National University of Defense Technology, Changsha 410073, China)
摘要
图/表
参考文献
相似文献
本文已被:浏览 536次   下载 1482
Received:October 29, 2021    Revised:November 29, 2021
中文摘要: 考虑无人机群体行为决策与状态变化的内在驱动, 从信息处理角度提出基于决策知识学习的多无人机航迹协同规划方法. 首先, 基于马尔科夫决策过程对无人机的行为状态进行知识表示, 形成关于连续动作空间的决策知识; 然后, 提出基于知识决策学习的深度确定性策略梯度算法, 实现无人机在决策知识层次上的协同规划. 实验结果表明: 在研发设计演示系统的基础上, 所提方法通过强化学习能够得到一个最优航迹规划策略, 同时使航迹综合评价和平均奖励收敛稳定, 为无人机任务执行提供了决策支持.
Abstract:Considering the internal driving mechanism of behavior decision-making and state changes of multiple UAVs, a collaborative trajectory planning method based on decision-making knowledge learning is proposed from the perspective of information processing. Firstly, the behavior states of UAVs are represented by knowledge on the basis of the Markov decision process, and the decision-making knowledge on continuous action space is developed. Then, a deep deterministic policy gradient (DDPG) algorithm based on decision-making knowledge learning is presented to achieve the collaborative planning of UAVs on the decision-making knowledge level. The experimental results reveal that on the basis of developing a demonstration system, the method can obtain an optimal trajectory planning strategy by reinforcement learning and can simultaneously achieve the convergence and stability of the comprehensive evaluation and average reward of trajectories, which provides decision-making support for mission execution of UAVs.
文章编号:     中图分类号:    文献标志码:
基金项目:
引用文本:
曾熠,刘丽华,李璇,杜溢墨,陈丽娜.基于决策知识学习的多无人机航迹协同规划.计算机系统应用,2022,31(8):125-132
ZENG Yi,LIU Li-Hua,LI Xuan,DU Yi-Mo,CHEN Li-Na.Trajectory Collaborative Planning of Multi-UAV Based on Decision-making Knowledge Learning.COMPUTER SYSTEMS APPLICATIONS,2022,31(8):125-132