Supercharge Your Innovation With Domain-Expert AI Agents!

Control method and device based on reinforcement learning, and controlled motion equipment

A technology of sports equipment and control methods, applied in the direction of adaptive control, general control system, control/adjustment system, etc., can solve the problem that it is difficult to accurately control fighters to autonomously maneuver to the target area, and achieve the effect of reducing the difficulty of training

Inactive Publication Date: 2021-01-22
PLA PEOPLES LIBERATION ARMY OF CHINA STRATEGIC SUPPORT FORCE AEROSPACE ENG UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In view of this, the purpose of the present invention is to provide a control method, device and controlled motion equipment based on reinforcement learning, which is used to solve the problem of accurately controlling the fighter to maneuver autonomously to the target area in the existing scheme based on DRL. Problems in the target area

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Control method and device based on reinforcement learning, and controlled motion equipment
  • Control method and device based on reinforcement learning, and controlled motion equipment
  • Control method and device based on reinforcement learning, and controlled motion equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056]The method of controlling fighter planes based on the deep reinforcement learning model provided in the prior art, however, in the process of training the deep reinforcement learning model, due to the large activity space of the fighter plane, and for fixed-wing aircraft, its movement in all directions The exploration capabilities are also uneven, resulting in the fighter jets having exploration blind spots in the activity space. Exploration blind spots refer to spaces that fighter jets cannot enter. If the exploration blind zone is selected as the center of the attack area during the training process, then the deep reinforcement learning model cannot output the result of controlling the fighter into the exploration blind zone, and the interactive data generated by using the exploration blind zone to train the deep reinforcement learning model will be put into the experience playback After pooling, the data in the experience playback pool will be polluted, which will dire...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a control method and device based on reinforcement learning, and controlled motion equipment. The method comprises the following steps: obtaining the situation information of the controlled motion equipment, and judging whether a deviation angle is greater than an angle threshold value or not; and if it is judged that the deviation angle is larger than an angle threshold value, inputting the situation information into an angle capture network obtained through pre-training, outputting a first action of controlled motion equipment through the angle capture network so as tocontrol the controlled motion equipment to enter a conical space, and controlling the controlled motion equipment to approach the target through the distance capture network in the conical space. Theinteraction space of the distance capture network is the conical space, and then only the coordinates in the conical space are used as the training samples to train the distance capture network, so that the training difficulty is reduced, and the fighter aircraft is quickly and accurately controlled to autonomously maneuver to the attack area.

Description

technical field [0001] The invention belongs to the technical field of automatic control, and in particular relates to a control method, device and controlled motion equipment based on reinforcement learning. Background technique [0002] In the DRL (Deep Reinforcement Learning, Deep Reinforcement Learning)-based control method of the controlled object equipment (such as a fighter plane), the autonomous maneuvering of the fighter plane to the target is a prerequisite for the fighter plane to effectively strike the target. [0003] However, the interaction space of fighter jets is large, and it is difficult to accurately control the fighter jets to autonomously maneuver to the target area when controlling the fighter jets to maneuver to the target based on DRL in the prior art. Contents of the invention [0004] In view of this, the object of the present invention is to provide a control method, device and controlled motion equipment based on reinforcement learning, which i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G05B13/04
CPCG05B13/042
Inventor 杨海涛潘耀宗何宇强杰姜海洋谢海平张健谢凯利袁春慧刘翔宇张裕
Owner PLA PEOPLES LIBERATION ARMY OF CHINA STRATEGIC SUPPORT FORCE AEROSPACE ENG UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More