Wireless network resource allocation method based on generative adversarial reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of wireless network resources and allocation method, applied in the field of wireless network resource allocation based on generative confrontation reinforcement learning, to achieve the effect of improving wireless network performance

Active Publication Date: 2020-05-19

ZHEJIANG UNIV

View PDF4 Cites 8 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0007] However, the traditional method based on action value learning (such as deep Q network) is difficult to cope with the disturbance and the uncertainty of immediate reward in the environment. Therefore, distribution reinforcement learning is introduced. The main change is to directly estimate the action value distribution. Instead of estimating the expectation of the action value as traditional methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0085] On the host with the configuration shown in Table 1, a simulation environment is written in Python language, and three different types of services (calling, video and ultra-reliable and low-latency services) are used as examples for testing. The resources to be allocated are wireless bandwidth, the total bandwidth is 10M, and the allocation granularity is 1M, so there are 36 allocation strategies in total, that is, the number of valid actions is 36. The discount factor γ is set to 0.9, the number of samples N for sampling the overall return distribution is 50, and the initial value of ∈ is 0.9, which decreases by 0.05 every 100 runs of the algorithm, and remains unchanged when it decreases to 0.05. buffer size N B is 10000. The G network has 3 neurons in the input layer, 512 neurons in the first hidden layer, 512 neurons in the second hidden layer, and 1800 neurons in the output layer. The D network has 50 neurons in the input layer, 256 neurons in the first hidden ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a wireless network resource allocation method based on generative adversarial reinforcement learning, and belongs to the field of wireless resource allocation and reinforcementlearning. The method comprises the following steps: initializing a generator network G and a discriminator network D, executing resource allocation, training the weights of the generator network G and the discriminator network D, and finally realizing wireless network resource allocation. Compared with a DQN-based resource allocation method and an average resource allocation method, the resourceallocation strategy obtained by the invention can obtain a higher system return value, i.e., higher spectral efficiency and better user experience.

Description

technical field [0001] The present invention relates to the field of wireless network resource allocation and reinforcement learning, and more particularly, to a wireless network resource allocation method based on generative confrontation reinforcement learning. Background technique [0002] The 5G network will support a large number of diverse business scenarios from vertical industries, such as smart security, high-definition video, telemedicine, smart home, autonomous driving, and augmented reality, etc. These business scenarios usually have different communication requirements, such as augmented reality technology requires more Low latency, autonomous driving technology requires the network to provide higher reliability. However, traditional mobile networks are mainly designed to serve a single mobile broadband service and cannot adapt to the diversified service scenarios of 5G in the future. Building a dedicated physical network for each business scenario will inevita...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): H04W72/04G06N3/08

CPCH04W72/04G06N3/08H04W72/53H04W28/16G06N3/047G06N3/045

Inventor 李荣鹏华郁秀马琳张宏纲

Owner ZHEJIANG UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Wireless network resource allocation method based on generative adversarial reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology