Terminal access selection method based on deep reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and terminal access, which is applied in the field of communication networks to achieve the effects of improving resource utilization, transmission rate, and transmission stability.

Inactive Publication Date: 2020-02-18

UNIV OF ELECTRONICS SCI & TECH OF CHINA

View PDF10 Cites 19 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] In order to solve the signal access switching problem caused by the movement of terminals in heterogeneous networks, the present invention proposes a terminal access selection method based on deep reinforcement learning, and implements heterogeneous network access selection based on terminal self-learning

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0037] In order to facilitate those of ordinary skill in the art to understand the present invention, at first the following technical terms are defined:

[0038] 1. Q-Learning

[0039] A reinforcement learning algorithm, the agent perceives the environment by performing actions in the environment to obtain a certain reward, so as to learn the mapping strategy from state to action to maximize the reward value.

[0040] 2. Deep-Q-Learning (DQN)

[0041]DQN is the first to combine deep learning models with reinforcement learning to successfully learn control policies directly from high-dimensional inputs. By introducing the method of expected delayed return, the MDP (Markov Decision Process) problem under the condition of lack of information is solved. It can be considered that DQN learning is based on the instantaneous strategy and is a special deep reinforcement learning method of an independent model.

[0042] 3. Adaptive

[0043] According to the data characteristics of t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a terminal access selection method based on deep reinforcement learning, is applied to the field of communications, and the problem of signal access switching caused by movement of a terminal in a heterogeneous network is solved. According to the invention, various factors such as service quality requirements of different types of businesses and service quality assurance capabilities of different types of access nodes are comprehensively considered; and perception decision making on the current network environment is carried out by adopting a Dep-Q-Learning deep reinforcement learning algorithm, thereby realizing terminal intelligent access selection decision making based on environment and resource perception. The communication experience of the user is effectivelyimproved, and the algorithm has self-adaptability and online learning capability.

Description

technical field [0001] The invention belongs to the field of communication networks, and in particular relates to a terminal switching access technology in a wireless heterogeneous network. Background technique [0002] With the explosive growth of the number of mobile terminals and traffic, a single network can no longer meet performance requirements such as high coverage, low latency, and high bandwidth. Therefore, the new-generation network will integrate existing heterogeneous networks to maximize network performance and efficiency. On the basis of this network, the terminal access strategy is a problem that is expected to be solved. [0003] Due to the diversity of terminal services, terminals have different requirements for signal strength, delay, and signal-to-noise ratio. The terminals that the network provides services include not only smartphones, but also IoT devices such as smart homes and vehicle-mounted smart terminals. The needs of these terminals include vid...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): H04W48/08H04W48/16

CPCH04W48/08H04W48/16

Inventor 黄晓燕成泽坤杨宁冷甦鹏

Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA

Terminal access selection method based on deep reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology