Energy efficiency-oriented multi-agent deep reinforcement learning optimization method for unmanned aerial vehicle group

A technology of energy efficiency and reinforcement learning, which is applied in wireless communication, power management, electrical components, etc., can solve problems such as the inability of the algorithm to converge and converge, and achieve the effects of enhancing dynamic adaptability, improving life cycle, and improving energy efficiency
CN110958680AActive Publication Date: 2020-04-03YANGTZE NORMAL UNIVERSITY

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
YANGTZE NORMAL UNIVERSITY
Publication Date
2020-04-03

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention discloses an energy efficiency-oriented multi-agent deep reinforcement learning optimization method for an unmanned aerial vehicle group. The method comprises the steps of adopting an improved DQN deep reinforcement learning method based on Q learning, training and updating the neural network of each intelligent agent by using historical information of the unmanned aerial vehicle cluster to obtain channel selection and power selection decisions of each intelligent agent of the unmanned aerial vehicle cluster, training the neural network by using a short-time experience playback mechanism in the training process, and maximizing the energy efficiency value of the corresponding intelligent agent by using the optimization target of each neural network. According to the invention,a distributed multi-agent deep strong chemical method is adopted, and a short-time experience playback mechanism is set to train a neural network to mine a change rule contained in a dynamic networkenvironment. The problem that a convergence solution cannot be obtained in a large state space faced by traditional reinforcement learning is solved. Multi-agent distributed cooperative learning is achieved, the energy efficiency of unmanned aerial vehicle cluster communication is improved, the life cycle of an unmanned aerial vehicle cluster is prolonged, and the dynamic adaptive capacity of an unmanned aerial vehicle cluster communication network is enhanced.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to the technical field of UAV cluster communication network access, in particular to an energy efficiency-oriented multi-agent deep reinforcement learning optimization method for UAV swarms. Background technique

[0002] At present, the rapid development and application promotion of UAV technology is one of the frontier and hot issues, which has attracted extensive attention. Among them, the research on UAV swarms is the most eye-catching. UAV swarms can use low-cost UAVs to form groups according to different roles, and play a huge role in coordinated actions.

[0003] But the key to the synergy of drone swarms lies in their robust communication network. Without a communication system to support the internal members of the UAV cluster, its coordinated action is out of the question.

[0004] At the same time, the optimization of energy consumption of small UAVs, especially battery-powered UAVs, is crucial. The construction and ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More