Unlock instant, AI-driven research and patent intelligence for your innovation.

Unmanned aerial vehicle autonomous flight control method based on offline reinforcement learning

A technology of flight control and reinforcement learning, applied in vehicle position/route/height control, attitude control, control/adjustment system, etc., can solve the problem of high training cost, achieve low training cost, improve generalization and robustness , the effect of speeding up the training speed

Active Publication Date: 2021-07-13
NANJING UNIV
View PDF10 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Purpose of the invention: Aiming at the problem of high training cost of the reinforcement learning algorithm in the real environment, the present invention provides a UAV autonomous flight control method based on off-line reinforcement learning, which collects UAV flight data artificially, and does not require any connection with reality during the training process. Real-time interaction with the environment, using offline reinforcement learning algorithms, can train a generalized and robust autonomous flight control strategy at a very low cost, which is suitable for complex and changeable real environments

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unmanned aerial vehicle autonomous flight control method based on offline reinforcement learning
  • Unmanned aerial vehicle autonomous flight control method based on offline reinforcement learning
  • Unmanned aerial vehicle autonomous flight control method based on offline reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] Below in conjunction with specific embodiment, further illustrate the present invention, should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various equivalent forms of the present invention All modifications fall within the scope defined by the appended claims of the present application.

[0038] Such as figure 1Shown, the overall framework schematic diagram of the present invention. First, the drone is artificially controlled to fly, the flight status and actions are stored, and the flight data set is generated. Then based on the flight data set, the offline reinforcement learning algorithm is used for offline training, and then the control strategy (feature network and policy network) is tested in the real environment, and the flight data is collected and added to the flight data...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an unmanned aerial vehicle autonomous flight control method based on offline reinforcement learning, and the method comprises the following steps: (1) manually controlling an unmanned aerial vehicle to execute a flight task, collecting the flight data of the unmanned aerial vehicle in a real environment, and generating a data set; (2) designing a reward function according to the flight state and action based on the data set; (3) based on an offline reinforcement learning algorithm, training an autonomous flight control strategy only by using the data set; (4) in a real environment, controlling the unmanned aerial vehicle to execute a flight task by using an autonomous flight control strategy, monitoring and testing the performance of the control strategy by an unmanned aerial vehicle operator in real time, and collecting flight data; (5) adding the collected new flight data into the data set; and (6) iteratively executing the steps (2), (3), (4) and (5) until the autonomous flight control strategy can complete the flight task. According to the method, an autonomous flight control strategy with good generalization and robustness can be trained with low cost, and the method is suitable for a complex and changeable real environment.

Description

technical field [0001] The invention relates to an autonomous flight control method of an unmanned aerial vehicle based on off-line reinforcement learning, and belongs to the technical field of autonomous flight control of an unmanned aerial vehicle. Background technique [0002] Due to its simple structure, flexibility and mobility, UAV has been widely used, and has an irreplaceable position in the fields of aerial photography, surveying and mapping, logistics and navigation. UAV system control technology has developed rapidly in recent years, and autonomous flight technology is one of its core technologies. With the extensive application of UAVs in various scenarios, the flight environment faced by them is more complex and changeable, and the uncertainty is greatly increased. Traditional UAV autonomous flight methods can often achieve satisfactory results in some simple environments, but it is difficult to cope with flight tasks in highly dynamic environments. [0003] I...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G05D1/08G05D1/10
CPCG05D1/0808G05D1/101
Inventor 俞扬詹德川周志华高永青秦熔均陈雄辉庞竟成袁雷管聪
Owner NANJING UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More