本文已被:浏览 7次 下载 25次
Received:April 28, 2024 Revised:June 17, 2024
Received:April 28, 2024 Revised:June 17, 2024
中文摘要: 深度强化学习算法在无人机的航迹规划任务中的应用越来越广泛, 但是许多研究没有考虑随机变化的复杂场景, 针对以上问题, 本文提出一种基于TD3改进的PP-CMNTD3算法, 提出了一种简单有效的先验策略并且借鉴人工势场的思想设计了密集奖励, 能够更好地引导无人机有效避开障碍物并且快速接近目标点. 仿真结果表明, 算法的改进可以有效提高网络的训练效率以及在复杂场景中的航迹规划表现, 同时能够在不同初始电量的情况下都能够灵活调整策略, 做到在能耗和迅速抵达目的地之间的有效平衡.
中文关键词: 深度强化学习 无人机 航迹规划 人工势场 双延迟深度确定性策略梯度算法
Abstract:Deep reinforcement learning algorithms are more and more widely used in UAV trajectory planning tasks, but many studies do not consider complex scenarios of random changes. To address the above problems, this study proposes an improved PP-CMNTD3 algorithm based on TD3, which puts forward a simple and effective prior strategy and draws on the idea of artificial potential fields to design dense rewards. UAVs are better guided to effectively avoid obstacles and swiftly approach target points. Simulation results show that the algorithm improvement can effectively improve the training efficiency of the network and the trajectory planning performance in complex scenarios. At the same time, the strategy can be flexibly adjusted under different initial power levels, achieving an effective balance between energy consumption and rapid arrival at the destination.
keywords: deep reinforcement learning unmanned aerial vehicle (UAV) trajectory planning artificial potential field twin delayed deep deterministic policy gradient (TD3) algorithm
文章编号: 中图分类号: 文献标志码:
基金项目:
引用文本:
牟文心,时宏伟.基于改进TD3算法的无人机轨迹规划.计算机系统应用,,():1-13
MU Wen-Xin,SHI Hong-Wei.UAV Trajectory Planning Based on Improved TD3 Algorithm.COMPUTER SYSTEMS APPLICATIONS,,():1-13
牟文心,时宏伟.基于改进TD3算法的无人机轨迹规划.计算机系统应用,,():1-13
MU Wen-Xin,SHI Hong-Wei.UAV Trajectory Planning Based on Improved TD3 Algorithm.COMPUTER SYSTEMS APPLICATIONS,,():1-13