Multi-unmanned aerial vehicle motion planning method based on artificial potential field method and MADDPG

An artificial potential field method and multi-UAV technology, applied in the field of multi-UAV motion planning, can solve the problems of too large joint space dimension, discrete action space, and too low dimension, so as to speed up the convergence rate and accelerate the Training speed, the effect of good application prospects

Active Publication Date: 2021-06-11
NORTHWESTERN POLYTECHNICAL UNIV
View PDF8 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The public patent CN109059931A proposes a motion planning method based on multi-agent reinforcement learning. Based on the most basic reinforcement learning Q-Learning algorithm, the global agent and local agent with complementary functions are used to explore and utilize the flight environment, respectively. Corresponding to global path planning and local path planning, however, this method requires the action space to be discrete and the dimension cannot be too high, and the obtained strategy is discrete, which has poor adaptability in a dynamic and complex environment; while achieving efficient continuous decision-making, As an excellent multi-agent deep reinforcement learning algorithm, the MADDPG algorithm adopts the framework of centralized learning and decentralized action, without the need to establish realistic communication rules, and can solve the problem of non-stationarity of the environment well, providing an excellent solution for multi-agent reinforcement learning. basic framework
However, when this algorithm is applied to the motion planning of a large number of agents, there is a common problem that the dimension of the joint space is too large, and the training period is significantly longer, making it difficult to converge, and even the stability and dynamic adaptability of the training model are extremely poor. problems, it is difficult to apply to dynamic and complex environments, and has certain limitations

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-unmanned aerial vehicle motion planning method based on artificial potential field method and MADDPG
  • Multi-unmanned aerial vehicle motion planning method based on artificial potential field method and MADDPG
  • Multi-unmanned aerial vehicle motion planning method based on artificial potential field method and MADDPG

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment

[0169] The multi-UAV motion planning method based on the artificial potential field method and MADDPG proposed by the present invention initializes the environment and neural network parameters at the beginning of each round, and then the multi-UAV selects actions to move and change positions to obtain a new state. And the experience of each round is saved in the experience pool as a learning sample, and the parameters of the neural network are updated iteratively. After the training is completed, the network parameters are saved and loaded to multiple UAVs in a specific environment for testing to verify the adaptability and efficiency of the planning strategy. In the training process of the present invention, the action selection depends on the parameter p. The probability of multi-UAV using the artificial potential field method to select an action is the parameter p, and the probability of using the exploration or strategy network to select the action is the parameter 1-p. Th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-unmanned aerial vehicle motion planning method based on an artificial potential field method and MADDPG. According to the method, high-quality experience of successfully planning a plurality of unmanned aerial vehicles to a target through the artificial potential field method is added on the basis of the original multi-unmanned aerial vehicle exploration environment experience, through the MADDPG algorithm training, samples are collected from exploration environment experience and high-quality experience at a certain probability, state information and environment information of each unmanned aerial vehicle serve as input of a neural network, the speeds of the multiple unmanned aerial vehicles serve as output, training of a motion planning strategy is completed, the multi-unmanned aerial vehicle autonomous obstacle avoidance in a complex environment is realized, and the target position is safely and quickly reached. According to the method, the Q values of the multiple unmanned aerial vehicles in different states and different actions are fully learned, the robustness of the strategy is improved, an excellent strategy with higher adaptability and higher flexibility is trained, and the method has a good application prospect in a scene of cooperative motion planning of the multiple unmanned aerial vehicles.

Description

technical field [0001] The invention belongs to the technical field of unmanned aerial vehicles, and in particular relates to a multi-unmanned aerial vehicle motion planning method. Background technique [0002] With the continuous development and improvement of science and technology, the multi-UAV motion planning technology, that is, the path planning technology that realizes the efficient obstacle avoidance of multiple UAVs, has been widely used in all aspects of human production and life. play an important role in revolutionizing industrial efficiency. The traditional multi-UAV motion planning method is dedicated to using some search algorithms and planning algorithms to calculate a better path when the environment is completely known and basically fixed, and the UAV positioning is accurate and meets the relevant motion trajectory. , such as A* algorithm, artificial potential field method (Artificial Potential Field, referred to as APF) and vector field histogram algori...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G05D1/10
CPCG05D1/101
Inventor 万开方武鼎威高晓光胡子剑
Owner NORTHWESTERN POLYTECHNICAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products