Frequency-power united allocation method based on multi-agent reinforcement learning in dynamic frequency spectrum environment

A dynamic spectrum and frequency power technology, applied in the field of cognitive radio, can solve the problems of performance discount and inability to converge in RL learning methods

Active Publication Date: 2012-05-09
PLA UNIV OF SCI & TECH
View PDF4 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, when multiple SUs coexist, the performance of the RL learning method will be greatly reduced, or even unable to converge,

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Frequency-power united allocation method based on multi-agent reinforcement learning in dynamic frequency spectrum environment
  • Frequency-power united allocation method based on multi-agent reinforcement learning in dynamic frequency spectrum environment
  • Frequency-power united allocation method based on multi-agent reinforcement learning in dynamic frequency spectrum environment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0063] The present invention will be further described below in conjunction with the accompanying drawings and embodiments.

[0064] The frequency power connection and distribution method based on multi-agent learning provided by the present invention, the present invention adopts such as figure 1 In the time slot structure of the cognitive network shown, each complete transmission process consists of 3 time slots. The first time slot is used for frequency power combination and decision-making, the second time slot is used for information transmission, and the third time slot is used for ACK reception. In time slot 1, each cognitive link makes a joint frequency decision and power decision based on the perceived state of the primary user, the Q function in the previous transmission process, the I value table, and the strategy estimates of other links. In time slot 2, the cognitive link transmits information, and at the same time calculates the current instantaneous reward val...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a frequency-power united allocation method based on multi-agent reinforcement learning in a dynamic frequency spectrum environment. A time slot structure for realizing the frequency-power united allocation in the dynamic frequency spectrum environment comprises three time slots: a perception decision time slot, a transmission time slot and a confirmation time slot, wherein the perception decision time slot is used for realizing the perception of a main user state and acquiring a frequency-power united decision of the transmission time slot; the transmission time slot is used for realizing information transmission, estimating a link gain at a receiving end, calculating a current return value and metering the size of interference; and the confirmation time slot is used for receiving ACK (Acknowledgement Character) signals, the link gain, the current return value and feedback information of the size of the interference, and updating a behavior return function needed by the multi-agent reinforcement learning, a historical interference table and estimated values of frequency policies of other cognitive links. According to the method provided by the invention, the real-time frequency-power united allocation with excellent performance can be realized in the dynamic frequency spectrum environment.

Description

technical field [0001] The invention relates to the field of cognitive radio in the wireless communication technology, in particular to a new method for realizing joint allocation of frequency power in a dynamic spectrum environment based on multi-agent reinforcement learning. Background technique [0002] At present, with the rapid growth of types of wireless communication services, the demand for wireless spectrum resources is also increasing exponentially, making the problem of "lack" of spectrum resources in future wireless communications increasingly prominent. Cognitive radio technology utilizes the idle frequency band of licensed users in the way of "opportunistic access" under the condition of ensuring the quality of service of primary users, which greatly improves the efficiency of spectrum use and is an effective method to solve the problem of "spectrum scarcity". The frequency and power joint allocation technology in cognitive radio networks (CRN for short) reduce...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): H04W16/14H04W72/04
Inventor 王金龙吴启晖刘鑫郑学强
Owner PLA UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products