Wireless network resource allocation method based on generative adversarial reinforcement learning

A technology of wireless network resources and allocation method, applied in the field of wireless network resource allocation based on generative confrontation reinforcement learning, to achieve the effect of improving wireless network performance

Active Publication Date: 2020-05-19
ZHEJIANG UNIV
View PDF4 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] However, the traditional method based on action value learning (such as deep Q network) is difficult to cope with the disturbance and the uncertainty of immediate reward in the environment. Therefor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Wireless network resource allocation method based on generative adversarial reinforcement learning
  • Wireless network resource allocation method based on generative adversarial reinforcement learning
  • Wireless network resource allocation method based on generative adversarial reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0085] On the host with the configuration shown in Table 1, a simulation environment is written in Python language, and three different types of services (calling, video and ultra-reliable and low-latency services) are used as examples for testing. The resources to be allocated are wireless bandwidth, the total bandwidth is 10M, and the allocation granularity is 1M, so there are 36 allocation strategies in total, that is, the number of valid actions is 36. The discount factor γ is set to 0.9, the number of samples N for sampling the overall return distribution is 50, and the initial value of ∈ is 0.9, which decreases by 0.05 every 100 runs of the algorithm, and remains unchanged when it decreases to 0.05. buffer size N B is 10000. The G network has 3 neurons in the input layer, 512 neurons in the first hidden layer, 512 neurons in the second hidden layer, and 1800 neurons in the output layer. The D network has 50 neurons in the input layer, 256 neurons in the first hidden ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a wireless network resource allocation method based on generative adversarial reinforcement learning, and belongs to the field of wireless resource allocation and reinforcementlearning. The method comprises the following steps: initializing a generator network G and a discriminator network D, executing resource allocation, training the weights of the generator network G and the discriminator network D, and finally realizing wireless network resource allocation. Compared with a DQN-based resource allocation method and an average resource allocation method, the resourceallocation strategy obtained by the invention can obtain a higher system return value, i.e., higher spectral efficiency and better user experience.

Description

technical field [0001] The present invention relates to the field of wireless network resource allocation and reinforcement learning, and more particularly, to a wireless network resource allocation method based on generative confrontation reinforcement learning. Background technique [0002] The 5G network will support a large number of diverse business scenarios from vertical industries, such as smart security, high-definition video, telemedicine, smart home, autonomous driving, and augmented reality, etc. These business scenarios usually have different communication requirements, such as augmented reality technology requires more Low latency, autonomous driving technology requires the network to provide higher reliability. However, traditional mobile networks are mainly designed to serve a single mobile broadband service and cannot adapt to the diversified service scenarios of 5G in the future. Building a dedicated physical network for each business scenario will inevita...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): H04W72/04G06N3/08
CPCH04W72/04G06N3/08H04W72/53H04W28/16G06N3/047G06N3/045
Inventor 李荣鹏华郁秀马琳张宏纲
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products