Dynamic beam scheduling method based on deep reinforcement learning

A technology that enhances learning and scheduling methods, and is applied to radio transmission systems, electrical components, transmission systems, etc., to achieve the effect of reducing transmission waiting delays

Active Publication Date: 2018-12-07
BEIJING UNIV OF POSTS & TELECOMM
View PDF4 Cites 59 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, the dynamic beam scheduling problem of a multi-beam satellite communication system is a sequential decision-making problem in a complex environment

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dynamic beam scheduling method based on deep reinforcement learning
  • Dynamic beam scheduling method based on deep reinforcement learning
  • Dynamic beam scheduling method based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] In order to facilitate those of ordinary skill in the art to understand and implement the present invention, the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0048] like figure 1 , is a schematic diagram of the application scenario of the dynamic beam scheduling method based on deep enhanced learning in the present invention, the satellite provides K beams, and the beam set k is the beam number, there are N cells under the coverage of K beams, and the cell set n is the cell number, and the satellite covers N cells in a time-division multiplexing manner by quickly switching K beams, where K

[0049] In each time slot, all cells request data packets from the satellite, and these data packets will be stored in the satellite buffer, and then K beams are allocated to the corresponding cells through the dynamic beam scheduling method, and the data packets are sent to these cells, and then The remaining pac...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a dynamic beam scheduling method based on deep reinforcement learning, which belongs to the field of multi-beam satellite communication systems. The dynamic beam scheduling method comprises the steps of: firstly, modeling a dynamic beam scheduling problem into a Markov decision process, wherein states of each time slot comprise a data matrix, a delay matrix and a channel capacity matrix in a satellite buffer, actions represent a dynamic beam scheduling strategy, and a target is the long-term reduction of accumulated waiting delay of all data packets; and secondly, solving a best action strategy by utilizing a deep reinforcement learning algorithm, establishing a Q network of a CNN+DNN structure, training the Q network, using the trained Q network to make action decisions, and acquiring the best action strategy. According to the dynamic beam scheduling method, a satellite directly outputs a current beam scheduling result according to the environment state at the moment through a large amount of autonomous learning, maximizes the overall performance of the system in the long term, and greatly reduces the transmission waiting delay of the data packets while keeping the system throughput almost unchanged.

Description

technical field [0001] The invention belongs to the field of multi-beam satellite communication systems, and relates to a dynamic beam scheduling method based on deep enhanced learning. Background technique [0002] As a supplement to the basic structure of the ground communication network, the satellite communication system has been studied for its strong global coverage, long communication distance, high system capacity, strong ability to resist major natural disasters, and the ability to provide fixed and mobile communication services. The attention of personnel and the support of the country. With the increasing demand for its capacity and the continuous consumption of spectrum resources, a multi-beam satellite communication system has been proposed. It uses multiple high-gain narrow beams to cover a large area together, which can effectively improve system performance. However, the more spot beams a satellite provides, the more transmitters are needed, and the cost of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04W72/04H04W72/12H04B7/185
Inventor 胡欣王艺鹏李秀华王卫东刘帅军张雨晨
Owner BEIJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products