Control method and device based on reinforcement learning, and controlled motion equipment

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of sports equipment and control methods, applied in the direction of adaptive control, general control system, control/adjustment system, etc., can solve the problem that it is difficult to accurately control fighters to autonomously maneuver to the target area, and achieve the effect of reducing the difficulty of training

Inactive Publication Date: 2021-01-22

PLA PEOPLES LIBERATION ARMY OF CHINA STRATEGIC SUPPORT FORCE AEROSPACE ENG UNIV

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] In view of this, the purpose of the present invention is to provide a control method, device and controlled motion equipment based on reinforcement learning, which is used to solve the problem of accurately controlling the fighter to maneuver autonomously to the target area in the existing scheme based on DRL. Problems in the target area

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0056]The method of controlling fighter planes based on the deep reinforcement learning model provided in the prior art, however, in the process of training the deep reinforcement learning model, due to the large activity space of the fighter plane, and for fixed-wing aircraft, its movement in all directions The exploration capabilities are also uneven, resulting in the fighter jets having exploration blind spots in the activity space. Exploration blind spots refer to spaces that fighter jets cannot enter. If the exploration blind zone is selected as the center of the attack area during the training process, then the deep reinforcement learning model cannot output the result of controlling the fighter into the exploration blind zone, and the interactive data generated by using the exploration blind zone to train the deep reinforcement learning model will be put into the experience playback After pooling, the data in the experience playback pool will be polluted, which will dire...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a control method and device based on reinforcement learning, and controlled motion equipment. The method comprises the following steps: obtaining the situation information of the controlled motion equipment, and judging whether a deviation angle is greater than an angle threshold value or not; and if it is judged that the deviation angle is larger than an angle threshold value, inputting the situation information into an angle capture network obtained through pre-training, outputting a first action of controlled motion equipment through the angle capture network so as tocontrol the controlled motion equipment to enter a conical space, and controlling the controlled motion equipment to approach the target through the distance capture network in the conical space. Theinteraction space of the distance capture network is the conical space, and then only the coordinates in the conical space are used as the training samples to train the distance capture network, so that the training difficulty is reduced, and the fighter aircraft is quickly and accurately controlled to autonomously maneuver to the attack area.

Description

technical field [0001] The invention belongs to the technical field of automatic control, and in particular relates to a control method, device and controlled motion equipment based on reinforcement learning. Background technique [0002] In the DRL (Deep Reinforcement Learning, Deep Reinforcement Learning)-based control method of the controlled object equipment (such as a fighter plane), the autonomous maneuvering of the fighter plane to the target is a prerequisite for the fighter plane to effectively strike the target. [0003] However, the interaction space of fighter jets is large, and it is difficult to accurately control the fighter jets to autonomously maneuver to the target area when controlling the fighter jets to maneuver to the target based on DRL in the prior art. Contents of the invention [0004] In view of this, the object of the present invention is to provide a control method, device and controlled motion equipment based on reinforcement learning, which i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G05B13/04

CPCG05B13/042

Inventor 杨海涛潘耀宗何宇强杰姜海洋谢海平张健谢凯利袁春慧刘翔宇张裕

Owner PLA PEOPLES LIBERATION ARMY OF CHINA STRATEGIC SUPPORT FORCE AEROSPACE ENG UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Control method and device based on reinforcement learning, and controlled motion equipment

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology