Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Satellite Derotation Method Based on Deep Reinforcement Learning

A reinforcement learning, satellite technology, applied in the field of satellite racemization, to achieve the effect of improving accuracy

Active Publication Date: 2022-05-31
NANJING UNIV OF POSTS & TELECOMM
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Q-learning makes it possible to find the optimal action strategy without the knowledge of immediate reward function and state transition function. In other words, Q-learning makes reinforcement learning no longer dependent on the problem model, but still needs to know the final reward or goal state

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Satellite Derotation Method Based on Deep Reinforcement Learning
  • A Satellite Derotation Method Based on Deep Reinforcement Learning
  • A Satellite Derotation Method Based on Deep Reinforcement Learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The following examples are only used to illustrate the technical solution of the present invention more clearly, but not to limit the protection scope of the present invention.

[0037] A satellite derotation method based on deep reinforcement learning, comprising the following steps:

[0038] S1. Marking the data samples of known satellites to establish a sample data set of known satellites;

[0039] S2. Using the fully convolutional neural network to train the sample data set, so that the terminal can understand and identify known satellites in the image or video, and obtain a confidence map of key points of known satellites in the image or video;

[0040] S3, tracking the motion trajectory of the key points in the video, and estimating the pose of the known satellite through the PNP algorithm;

[0041] S4. Use the DDPG algorithm to train the optimal derotation, and use the derotation brush equipped with the space manipulator to brush the side of the spacecraft sail ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a satellite derotation method based on deep reinforcement learning, which is characterized in that it comprises the following steps: marking data samples of known satellites to establish a sample data set of known satellites; using a fully convolutional neural network to train the sample data set , so that the terminal can understand and identify the known satellites in the image or video, and obtain the confidence map of the key points of the known satellites in the image or video; track the trajectory of the key points in the video, and estimate the position of the known satellite through the PNP algorithm Attitude; the optimal derotation is trained by the DDPG algorithm, and the derotation of the space manipulator brushes the side of the spacecraft sail to complete the satellite derotation. The method of the present invention realizes the derotation of the out-of-control satellite with high-speed spin by means of deep reinforcement learning, and at the same time combines the visual information to allow the computer to contact the data and model environment, train the optimal grasping pose, and improve the accuracy of the target capture of the space manipulator Spend.

Description

technical field [0001] The invention relates to a satellite derotation method based on deep reinforcement learning, and belongs to the technical field of satellite derotation methods. Background technique [0002] With the increase in the number of spacecraft in orbit and their wide application, real life is increasingly inseparable from the various application functions provided by spacecraft in orbit. Due to the limitation of the space on-orbit working organization's own conditions and the influence of the space environment, without any supply and maintenance, the operation is often forced to stop due to limited fuel, outdated equipment or module failure, and a new system has to be remanufactured and launched to replace it. Replacement, resulting in unnecessary losses and waste. GEO is the geosynchronous orbit. Carrying out GEO on-orbit maintenance and service and research on related technologies can effectively prolong the service life of the on-orbit system, and at the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): B64G1/10B64G1/24G06N3/04G06N3/08
CPCB64G1/10B64G1/24G06N3/08G06N3/045B64G1/245
Inventor 高浩李芳琳胡海东
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products