A caching strategy method in d2d network based on deep reinforcement learning

A reinforcement learning and network caching technology, applied in neural learning methods, biological neural network models, electrical components, etc., can solve the problems of high energy consumption, long delay, and low hit rate of cache content placement, and achieve low energy consumption and delay short-term effect

Active Publication Date: 2019-09-03
NORTHWESTERN POLYTECHNICAL UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] The purpose of the present invention is to provide a D2D network caching strategy method based on deep reinforcement learning, which solves the problems of low hit rate of cache content placement and high energy consumption and long delay in the cache delivery process in the cache-enabled D2D network

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A caching strategy method in d2d network based on deep reinforcement learning
  • A caching strategy method in d2d network based on deep reinforcement learning
  • A caching strategy method in d2d network based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0098] In this embodiment, a caching-enabled D2D network with 200 D2D users is considered, and selected content is distributed to D2D storage based on content popularity and user mobility prediction results. In order to simplify the simulation, in the deep reinforcement learning environment, the number of D2D users who satisfy user requests at each moment is set to a fixed value of 4, the distance d∈(0,4), the gain g∈(0,4), and P=1. In practical applications, this variation varies with time, but does not affect the accuracy of the algorithm.

[0099] Such as figure 1 Shown is the convergence performance graph of the present invention based on the deep reinforcement learning algorithm at different learning rates. It can be seen from the graph that the reward value of the system gradually tends to a stable value as time increases. Under the same training environment, the smaller the learning rate, the better the network performance of the system. When the learning rate is 0.01...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a D2D network caching strategy method based on deep reinforcement learning, which uses the historical location information of each user in the cache-enabled D2D network as input data, and obtains the next location information of each user through the echo state network algorithm. Time location information; according to the location information of each user at the next moment, combined with the context information of each user at the current moment, through the echo state network algorithm, the content request information of each user at the next moment is obtained; the content request information It is cached in the cache space of the corresponding user; through the deep reinforcement learning algorithm, the transmission power of the user who transmits the content request information is the smallest and the delay of the user receiving the content request information is the shortest, and the distance between each user in the cache-enabled D2D network is obtained. An optimal strategy for transferring content request information between users; the present invention solves the problems of low hit rate of cache content placement and high energy consumption and long delay in the process of cache delivery in a cache-enabled D2D network.

Description

[0001] 【Technical field】 [0002] The invention belongs to the technical field of cache-enabled D2D network cache transmission, and in particular relates to a cache strategy method in a D2D network based on deep reinforcement learning. [0003] 【Background technique】 [0004] In recent years, device-to-device (D2D) communication has attracted widespread attention in 5G wireless networks. This technology enables users to achieve direct communication within a certain distance without the assistance of base stations, and can effectively improve energy efficiency and Spectral efficiency. [0005] However, as the number of wireless device users increases exponentially, resulting in high traffic loads, this greatly increases backhaul link costs and transmission delays. The caching technology can eliminate repeated data transmission of popular content, reduce backhaul traffic and improve network throughput, and has become a strong candidate for 5G development. [0006] Considering t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): H04L29/08G06N3/08
CPCG06N3/08H04L67/5682H04L67/568
Inventor 李立欣徐洋李旭高昂梁微殷家应
Owner NORTHWESTERN POLYTECHNICAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products