Resource allocation and power control combined optimization method based on reinforcement learning in heterogeneous network

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and joint optimization, applied in electrical components, wireless communication, etc., can solve the problems of difficulty in obtaining the global optimal strategy, difficulty in obtaining information, etc., and achieve the effect of maximizing the utility of the system

Active Publication Date: 2018-09-11

CHINA CONSTR THIRD BUREAU FIRST CONSTR & INSTALLATION CO LTD

View PDF3 Cites 32 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Existing studies mostly consider the above optimization problem separately

At the same time, due to the non-convex and combinatorial nature of joint optimization problems, it is very difficult to obtain a globally optimal strategy

Existing literature proposes optimization methods such as game theory, linear programming, and Markov approximation, but most of these optimization methods require almost all network information, however, in general, such information is difficult to obtain

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0061] In order to facilitate those of ordinary skill in the art to understand and implement the present invention, the present invention will be described in further detail below in conjunction with the examples. It should be understood that the implementation examples described here are only used to illustrate and explain the present invention, and are not intended to limit the present invention.

[0062] The specific process is as follows: the joint optimization problem can be formally defined as MDP(S; A i ; i ;P), where S is a set of discrete environmental states, A 1 ,...,A N is a set of discrete possible actions, R 1 ,...,R N is the reward function, and P is the state transition matrix. Firstly, the basic model of reinforcement learning is described, and then, a joint optimization algorithm based on multi-agent reinforcement learning is proposed.

[0063] A. Basic model

[0064] Basic reinforcement learning elements related to defining state space, action space an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention belongs to the technical field of wireless communication, in particular to a resource distribution and power control combined optimization method based on reinforcement learning in a heterogeneous network. The method aims at dynamic and time-varying characteristics of factors such as transmission channels and transmission power, and an optimal resource distribution, user associationand power control combined strategy is obtained by utilizing a distributed Q learning method through establishing a multi-agent enhanced learning framework in combination with the conditions of user satisfaction degree and operator benefit seeking on the basis of establishing a heterogeneous cellular network system model on the premise that the user selfishness and the operator benefit in the heterogeneous network are considered, so that the maximization of the long-term system utility of the whole network is achieved.

Description

technical field [0001] The invention belongs to the technical field of wireless communication, and in particular relates to a joint optimization method for resource allocation and power control based on reinforcement learning in a heterogeneous network. Background technique [0002] With the rapid development of wireless devices and the continuous increase of people's demand for wireless services, the cellular network is facing the huge challenge of increasing network capacity requirements. By deploying multiple femto base stations (Femto Base Station, FBS) with different transmission power and coverage in the macro base station (Macro Base Station, MBS), not only can reduce the communication load and the cost of the operator, but also can use the same wireless frequency to improve the utilization of wireless spectrum. Therefore, heterogeneous cellular networks are expected to improve the system capacity and resource utilization of next-generation cellular networks. [000...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): H04W72/04H04W72/08H04W72/54

CPCH04W72/0473H04W72/543H04W72/23H04W72/53H04W72/541

Inventor 赵楠贺潇范孟林田超樊鹏飞裴一扬武明虎蒋云昊李利荣常春

Owner CHINA CONSTR THIRD BUREAU FIRST CONSTR & INSTALLATION CO LTD

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Resource allocation and power control combined optimization method based on reinforcement learning in heterogeneous network

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology