A Reinforcement Learning Method for Unmanned Aerial Vehicle Swarms Cooperatively Searching for Multiple Dynamic Targets in Uncharted Seas

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and dynamic targets, which is applied to the cooperative search of multi-dynamic targets by UAV groups in unknown sea areas. It can solve problems such as limitation, reduce the scale of search decision-making problems, and fail to satisfy multi-target searches, so as to improve search efficiency. Effect

Active Publication Date: 2022-03-22

DALIAN MARITIME UNIVERSITY

View PDF7 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] The traditional search method is to use coverage search, such as back-type search, traversal search, etc. This search method generally maximizes the coverage of the task area to find as many targets as possible. In recent years, a search graph model has been established in combination with the target existence probability , using distributed model predictive control to solve, effectively reducing the solution scale of the search decision problem, but only limited to the search of static targets

For dynamic targets, the Bayesian method is used to calculate the average detection time and average detection probability, but it is only applicable to the search for a single target at sea, and cannot meet the needs of multi-target search

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0053] In order to make the technical solutions and advantages of the present invention more clear, the technical solutions in the embodiments of the present invention are clearly and completely described below in conjunction with the drawings in the embodiments of the present invention:

[0054] Such as figure 1 Figure 7 A reinforcement learning UAV swarm collaborative search method for multiple dynamic targets in unknown seas is shown, which specifically includes the following steps:

[0055] S1: Use the grid method to divide the search sea area into Lx × L y raster. Establish a multi-UAV sea area search map based on the sea surface environment, UAV dynamics, sea surface movement ship dynamics and sensor detection model information, and establish a search map Where (m, n) is the grid coordinate, k is the time, and the specific numerical calculation process is as follows:

[0056] S11: Establishing Territorial Awareness Infographic: When Drone V i Generate pheromone H ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a reinforcement learning method for collaboratively searching multiple dynamic targets in an unknown sea area by a swarm of unmanned aerial vehicles, comprising the following steps: S1: using a grid method to divide the search sea area; Concentration to establish territorial awareness information map S2: Design the Q value table according to the UAV state information and decision-making u(k); S3: According to the Q value of the current state of the UAV swarm, the Boltzmann distribution mechanism is used to select the UAV's flight route and execute it; S4: Use the search efficiency function to design a reward and punishment function for evaluating the flight state of the UAV, and update the Q value of the new state of the UAV swarm according to the reward and punishment function; S5: Update the new state of the UAV swarm to the current state, Continue to make flight route decisions and finally complete the learning of the entire Q-value table. The UAV group makes decisions based on the trained Q-table to complete the search task.

Description

technical field [0001] The invention relates to the technical field of unmanned aerial vehicle control, in particular to a method for cooperatively searching multi-dynamic targets in an unknown sea area by a group of unmanned aerial vehicles through reinforcement learning. Background technique [0002] With the rapid development of technologies such as sensors, wireless communication, and intelligent control, the functions of unmanned swarm systems are increasingly enhanced, and their application fields are expanding. Due to its scalability, strong collaboration and low loss, the unmanned swarm system has attracted more and more attention from academia, industry and national defense in its collaborative theory and application research, and the multi-UAV cooperative search system can effectively improve the search Efficiency, especially for the search of dynamic targets in complex sea conditions such as uncertainty and strong interference, has great advantages. Therefore, mul...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G05D1/12

CPCG05D1/12G05D1/0088

Inventor 岳伟关显赫刘中常王丽媛

Owner DALIAN MARITIME UNIVERSITY

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

A Reinforcement Learning Method for Unmanned Aerial Vehicle Swarms Cooperatively Searching for Multiple Dynamic Targets in Uncharted Seas

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology