Three-dimensional group exploration method based on multi-head attention asynchronous reinforcement learning

A technology of reinforcement learning and attention, applied in neural learning methods, design optimization/simulation, instruments, etc., can solve problems such as insufficient exploration, low sample utilization, and inability to learn, so as to achieve a good effect of perceptual data collection and solve exploration problems. Insufficient questions, enhanced exploratory effects

Active Publication Date: 2021-08-20
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF5 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] 1. It is difficult to model complex application environments. The actual application scenarios of mobile group sensing are often dynamic and complex. For example, mobile group sensing data collection for post-disaster rescue. According to the results of environmental modeling, the current UAV swarm flight trajectory is reasonably planned for data collection tasks. Therefore, the accuracy of environmental modeling greatly affects the completion quality of group perception tasks. How to accurately and quickly create space for real application environments Die has become a big problem;
[0005] 2. Insufficient exploration of three-dimensional space. Aiming at the insufficient exploration caused by the explosion of three-dimensional space, it is necessary to design a reasonable, stable and efficient exploration mechanism to promote the drone group to quickly and efficiently perceive the entire unknown three-dimensional mobile group perception scene. Exploration to improve the quality and efficiency of drone swarm environment modeling and optimal trajectory search efforts
[0006] 3. The utilization rate of reinforcement learning samples is low. Existing reinforcement learning algorithms are faced with the problem of extremely low sample utilization rate, and cannot effectively and fully learn from the only samples. In reality, the cost of sample sources for 3D mobile group perception tasks is high , The acquisition speed is slow, how to make the algorithm more effective and fully sample and learn the existing samples without affecting the learning effect of the algorithm is an urgent problem to be solved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Three-dimensional group exploration method based on multi-head attention asynchronous reinforcement learning
  • Three-dimensional group exploration method based on multi-head attention asynchronous reinforcement learning
  • Three-dimensional group exploration method based on multi-head attention asynchronous reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0060] The content of the present invention will be further described in detail below in conjunction with the accompanying drawings of the description. Such as figure 1 Shown, method of the present invention comprises the following steps:

[0061] Step 1. The command center initializes the benchmark exploration strategy and environmental parameters, and the UAV group performs data collection according to the changes in the perceived environment:

[0062] Step 1.1. The main process of the command center sets up a shared sample reuse cache and initializes a benchmark exploration strategy, and establishes an empty shared sample reuse cache and initializes a benchmark exploration strategy on the command center in the 3D mobile crowd sensing scene;

[0063] Step 1.2, establish multiple sub-processes, synchronize the exploration strategies of the sub-processes and initialize the environmental parameters in each sub-process. The environmental parameters include the position of the U...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a three-dimensional group exploration method based on multi-head attention asynchronous reinforcement learning. The method comprises the following steps that 1, a command center host process sets a shared sample multiplexing cache and initializes a reference exploration strategy; 2, the command center starts a sub-process; 3, the command center adopts a pixel control algorithm to optimize an unmanned aerial vehicle exploration strategy based on the shared sample multiplexing cache; 4, the command center obtains the flight path of the unmanned aerial vehicle group based on the shared sample multiplexing cache by adopting a trust domain strategy algorithm; 5, the steps 2, 3 and 4 are repeatedly executed until the action track of the unmanned aerial vehicle group does not change any more; and 6, the command center sends an optimal trajectory transfer instruction to the unmanned aerial vehicle group. According to the method, the problem of low sample sampling efficiency of a reinforcement learning algorithm is solved, a better data acquisition effect is achieved by the algorithm when the same number of samples are used for learning, and an optimal track for maximizing data acquisition is further obtained.

Description

technical field [0001] The invention belongs to the field of mobile group perception, and in particular relates to a three-dimensional group exploration method based on multi-head attention asynchronous reinforcement learning. Background technique [0002] Mobile group sensing technology is currently developing rapidly and supports the data acquisition needs of smart cities. Mobile group sensing technology uses mobile devices used by a large number of users as the basic sensing unit, and cooperates through the mobile Internet to form an interactive and participatory sensing network, realize sensing task distribution and data collection and utilization, and finally complete large-scale and complex Social sensing tasks to help professionals or the public collect data, analyze data, and share data. However, the mobile group sensing system based on mobile devices is often affected by many aspects, such as the uncertainty of user movement and the quality of mobile devices. These...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F30/27G06K9/62G06N3/04G06N3/08
CPCG06F30/27G06N3/08G06N3/045G06F18/2415
Inventor 刘驰王昊戴子彭
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products