Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Solving Power Allocation Algorithm in Cognitive Radio Based on Reinforcement Learning

A cognitive radio and reinforcement learning technology, applied in the field of power allocation strategy, can solve the problem of incomplete channel information and can not power allocation, and achieve the effect of effectively adjusting the transmission power

Active Publication Date: 2021-12-24
NORTHWESTERN POLYTECHNICAL UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to provide a power allocation algorithm based on reinforcement learning to solve cognitive radio, so as to solve the problem that power allocation cannot be performed well under the premise of incomplete channel information in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Solving Power Allocation Algorithm in Cognitive Radio Based on Reinforcement Learning
  • Solving Power Allocation Algorithm in Cognitive Radio Based on Reinforcement Learning
  • Solving Power Allocation Algorithm in Cognitive Radio Based on Reinforcement Learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0040] 1. Simulation conditions: 1) The number of CUs is K=6, 2) The transmission power of the PU is P PU = 15dB, 3) the discount factor is β = 0.9, 4) the learning rate of the participants is η a = 0.01, 5) The critic's learning rate is η c = 0.001.

[0041] 2. Simulation content: simulate and compare the relationship between the spectral efficiency (Spectralefficiency, SE) performance of CUs and the time index under different learning algorithm scenarios. The results are as follows: figure 2 . figure 2 Among them, the vertical axis is "spectrum utilization rate of cognitive users"; the horizontal axis is "simulation iteration time".

[0042] Depend on figure 2 Simulation results show that by using Q-learning, continuous-valued states and actions must be quantized, and actual values ​​are replaced by finite discrete-valued approximations. Contrary to our AC-RL algorithm, the Q-learning based power allocation algorithm needs to know the immediate CSI of CUs. Figure 2 ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a power allocation algorithm based on reinforcement learning to solve cognitive radio, S1, setting the initial value parameters of the deep learning algorithm, S2, setting the scene model of the CR-NOMA system, and setting the initial state and action State collection; S3, when a certain calculation moment t is less than or equal to the maximum limit time value T max When , obtain the state value at time t and calculate the corresponding reward function, and calculate the TD error δ t ; S4. Select the user's next action based on the value function, and use the learning rate and the TD error value function to update the initial value function to Q(s t ,a t )←Q(s t ,a t )+η c δ t ; Then get the corresponding reward according to the selected execution action, and obtain the policy function π(g), and then update it to π(s t ,a t )←π(s t ,a t )‑η a δ t ;π(g); S5. According to step S3, the TD error value is minimized, iteratively updated, and finally the maximum reward function value is obtained, that is, the allocation algorithm ends. It solves the problem in the prior art that power allocation cannot be well performed under the premise of incomplete channel information.

Description

technical field [0001] The invention belongs to the technical field of communication, and in particular relates to a power allocation strategy, which can be used to solve the power allocation problem in an underlay cognitive radio network. Background technique [0002] Overlay cognitive radio networks can solve the problem of spectrum scarcity, that is, under the constraint that the interference caused by cognitive users cannot degrade the service quality of primary users, cognitive users can use the same spectrum to transmit simultaneously with primary users. On the other hand, Non-orthogonal Multiple Access (NOMA), as a potential technical challenge to improve the spectrum efficiency of future wireless networks, has fundamentally changed the design of conventional access technologies . Power domain non-orthogonal multiple access technology (Power-domain NOMA) is one of the most popular technologies in NOMA technology. Its core idea is to explore the power domain differenc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): H04B17/382H04W52/34
CPCH04B17/382H04W52/34
Inventor 梁微温书慧杨思远王大伟高昂李立欣
Owner NORTHWESTERN POLYTECHNICAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products