Satellite racemization method based on deep reinforcement learning

A technology of reinforcement learning and satellite, which is applied in the field of satellite derotation to achieve the effect of improving accuracy

Active Publication Date: 2021-02-09
NANJING UNIV OF POSTS & TELECOMM
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Q-learning makes it possible to find the optimal action strategy without the knowledge of immediate reward function and state transition function. In other words, Q-learning makes reinforcement learning no longer dependent on the problem model, but still needs to know the final reward or goal state

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Satellite racemization method based on deep reinforcement learning
  • Satellite racemization method based on deep reinforcement learning
  • Satellite racemization method based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The following examples are only used to illustrate the technical solution of the present invention more clearly, but not to limit the protection scope of the present invention.

[0037] A satellite derotation method based on deep reinforcement learning, comprising the following steps:

[0038] S1. Marking the data samples of known satellites to establish a sample data set of known satellites;

[0039] S2. Using the fully convolutional neural network to train the sample data set, so that the terminal can understand and identify known satellites in the image or video, and obtain a confidence map of key points of known satellites in the image or video;

[0040] S3, tracking the motion trajectory of the key points in the video, and estimating the pose of the known satellite through the PNP algorithm;

[0041] S4. Use the DDPG algorithm to train the optimal derotation, and use the derotation brush equipped with the space manipulator to brush the side of the spacecraft sail ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a satellite racemization method based on deep reinforcement learning, and the method is characterized in that the method comprises the following steps: marking data samples ofa known satellite, and establishing a sample data set of the known satellite; training the sample data set by using a full convolutional neural network, so that the terminal can understand and identify known satellites in the images or videos to obtain confidence maps of key points of the known satellites in the images or videos; tracking the motion trails of the key points in the video, and estimating the pose of a known satellite through a PNP algorithm; training optimal racemization through a DDPG algorithm, and completing satellite racemization by brushing the side edge of the spacecraft sailboard through racemization of the space manipulator. According to the method, by means of deep reinforcement learning, racemization of a high-speed spinning out-of-control satellite is achieved, meanwhile, a computer makes contact with data and a model environment in combination with visual information, the optimal grabbing pose is trained, and the target capturing accuracy of the space mechanical arm is improved.

Description

technical field [0001] The invention relates to a satellite derotation method based on deep reinforcement learning, and belongs to the technical field of satellite derotation methods. Background technique [0002] With the increase in the number of spacecraft in orbit and their wide application, real life is increasingly inseparable from the various application functions provided by spacecraft in orbit. Due to the limitation of the space on-orbit working organization's own conditions and the influence of the space environment, without any supply and maintenance, the operation is often forced to stop due to limited fuel, outdated equipment or module failure, and a new system has to be remanufactured and launched to replace it. Replacement, resulting in unnecessary losses and waste. GEO is the geosynchronous orbit. Carrying out GEO on-orbit maintenance and service and research on related technologies can effectively prolong the service life of the on-orbit system, and at the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): B64G1/10B64G1/24G06N3/04G06N3/08
CPCB64G1/10B64G1/24G06N3/08G06N3/045B64G1/245
Inventor 高浩李芳琳胡海东
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products