Resource allocation and power control combined optimization method based on reinforcement learning in heterogeneous network

A technology of reinforcement learning and joint optimization, applied in electrical components, wireless communication, etc., can solve the problems of difficulty in obtaining the global optimal strategy, difficulty in obtaining information, etc., and achieve the effect of maximizing the utility of the system

Active Publication Date: 2018-09-11
CHINA CONSTR THIRD BUREAU FIRST CONSTR & INSTALLATION CO LTD
View PDF3 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Existing studies mostly consider the above optimization problem separately
At the same time, due to the non-convex and combinatorial nature of joint optimization problems, it is very difficult to obtain a globally optimal strategy
Existing literature proposes optimization methods such as game theory, linear programming, and Markov approximation, but most of these optimization methods require almost all network information, however, in general, such information is difficult to obtain

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Resource allocation and power control combined optimization method based on reinforcement learning in heterogeneous network
  • Resource allocation and power control combined optimization method based on reinforcement learning in heterogeneous network
  • Resource allocation and power control combined optimization method based on reinforcement learning in heterogeneous network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0061] In order to facilitate those of ordinary skill in the art to understand and implement the present invention, the present invention will be described in further detail below in conjunction with the examples. It should be understood that the implementation examples described here are only used to illustrate and explain the present invention, and are not intended to limit the present invention.

[0062] The specific process is as follows: the joint optimization problem can be formally defined as MDP(S; A i ; i ;P), where S is a set of discrete environmental states, A 1 ,...,A N is a set of discrete possible actions, R 1 ,...,R N is the reward function, and P is the state transition matrix. Firstly, the basic model of reinforcement learning is described, and then, a joint optimization algorithm based on multi-agent reinforcement learning is proposed.

[0063] A. Basic model

[0064] Basic reinforcement learning elements related to defining state space, action space an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of wireless communication, in particular to a resource distribution and power control combined optimization method based on reinforcement learning in a heterogeneous network. The method aims at dynamic and time-varying characteristics of factors such as transmission channels and transmission power, and an optimal resource distribution, user associationand power control combined strategy is obtained by utilizing a distributed Q learning method through establishing a multi-agent enhanced learning framework in combination with the conditions of user satisfaction degree and operator benefit seeking on the basis of establishing a heterogeneous cellular network system model on the premise that the user selfishness and the operator benefit in the heterogeneous network are considered, so that the maximization of the long-term system utility of the whole network is achieved.

Description

technical field [0001] The invention belongs to the technical field of wireless communication, and in particular relates to a joint optimization method for resource allocation and power control based on reinforcement learning in a heterogeneous network. Background technique [0002] With the rapid development of wireless devices and the continuous increase of people's demand for wireless services, the cellular network is facing the huge challenge of increasing network capacity requirements. By deploying multiple femto base stations (Femto Base Station, FBS) with different transmission power and coverage in the macro base station (Macro Base Station, MBS), not only can reduce the communication load and the cost of the operator, but also can use the same wireless frequency to improve the utilization of wireless spectrum. Therefore, heterogeneous cellular networks are expected to improve the system capacity and resource utilization of next-generation cellular networks. [000...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04W72/04H04W72/08
CPCH04W72/0473H04W72/543H04W72/23H04W72/53H04W72/541
Inventor 赵楠贺潇范孟林田超樊鹏飞裴一扬武明虎蒋云昊李利荣常春
Owner CHINA CONSTR THIRD BUREAU FIRST CONSTR & INSTALLATION CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products