Check patentability & draft patents in minutes with Patsnap Eureka AI!

Autonomous obstacle avoidance system and method for UAV based on deep reinforcement learning

A technology of reinforcement learning and unmanned aerial vehicles, applied in mechanical equipment, combustion engines, internal combustion piston engines, etc., to achieve the effects of high training performance, low time consumption, and improved decision-making timeliness

Active Publication Date: 2022-06-03
中国人民解放军陆军指挥学院
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although these works can scale the trained networks to the real world, they still require a lot of computing resources to generate huge datasets and train them

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Autonomous obstacle avoidance system and method for UAV based on deep reinforcement learning
  • Autonomous obstacle avoidance system and method for UAV based on deep reinforcement learning
  • Autonomous obstacle avoidance system and method for UAV based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0034] In this embodiment, the discount coefficient is set to 0.95, the learning rate is 0.0001, and the original image size is 84*84. The instantaneous reward function is defined as , where is the time per training loop, set to 0.5 seconds. The reward is designed to make the robot run as fast as possible, and it is punished by simply spinning in place. If a collision is detected, the training episode is terminated immediately with a penalty of -10. Otherwise, the episode will continue to the maximum number of steps. The maximum number of steps is set to 500, and there is no penalty at this time. In this embodiment, the original 10,000 pictures are learned.

[0035] This embodiment is an autonomous obstacle avoidance method for UAVs based on deep reinforcement learning. For the obstacle avoidance process, please refer to figure 1 , which includes the following steps:

[0036] S1. Obtain the original RGB image collected by the UAV monocular camera.

[0037] S2. Using a ful...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a UAV autonomous obstacle avoidance system and method based on deep reinforcement learning. Through a novel system architecture, training and decision-making are separated, which can greatly reduce training time consumption and improve the timeliness of aircraft decision-making; the autonomous obstacle avoidance method adopts a method based on The deep reinforcement learning model of strategy iteration uses the original RGB image taken by the drone monocular camera as the training data, without the need for complex point clouds and other 3D information, and obtains the depth by training the original RGB image through a fully convolutional neural network Image information, and then analyze and predict the image through the reinforcement learning method based on strategy iteration, predict the flight action of the drone at the next moment in advance, and realize autonomous obstacle avoidance. The training time consumption of the obstacle avoidance method proposed by the present invention is more efficient and less time-consuming than the existing typical value-based iteration method, and can realize flexible and autonomous obstacle avoidance, and is suitable for high-speed inspections such as substation automatic inspections and drone cruises. Required autonomous obstacle avoidance scenarios.

Description

technical field [0001] The invention relates to an unmanned aerial vehicle obstacle avoidance system and method, in particular to an unmanned aerial vehicle autonomous obstacle avoidance system and method based on deep reinforcement learning, and belongs to the technical field of unmanned aerial vehicle flight control. Background technique [0002] Obstacle avoidance is one of the core problems of UAVs, and its goal is to allow UAVs to autonomously explore unknown environments to avoid collisions with other objects in order to obtain a flight path that can avoid threats and reach the target safely. Traditional obstacle avoidance technology detects traversable spaces and obstacles, and then performs path planning. The data information used is captured by RGB-D cameras, light detection, ranging sensors (LIDAR), and even sonar. These traditional obstacle avoidance technologies can be well applied to the autonomous obstacle avoidance of ground robots, but it is difficult to appl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G05D1/10
CPCY02T10/40
Inventor 王钦辉陈志龙魏军儒何昌其王云宪焦萍闫茜茜
Owner 中国人民解放军陆军指挥学院
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More