Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Heterogeneous Internet of Vehicles user association method based on multi-agent deep reinforcement learning

A reinforcement learning, multi-agent technology, applied in the field of wireless communication, can solve problems such as difficulty in implementation and large computing dimension, and achieve the effect of saving communication resources, reducing computing dimension, and improving computing efficiency

Pending Publication Date: 2022-05-06
NANJING UNIV OF SCI & TECH
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the above-mentioned single-agent method requires a large amount of or almost complete information for central decision-making, which not only makes it difficult to realize due to the large computational dimension, but also usually generates a lot of unnecessary communication overhead, so further research is needed on how to use less information and resources for better performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Heterogeneous Internet of Vehicles user association method based on multi-agent deep reinforcement learning
  • Heterogeneous Internet of Vehicles user association method based on multi-agent deep reinforcement learning
  • Heterogeneous Internet of Vehicles user association method based on multi-agent deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0058] This embodiment sets that when all vehicle users reach the end of the road, they complete a round. Each movement and association action made by the vehicle user in the round is called a time slot.

[0059] combination Figure 1 , this embodiment is a heterogeneous Internet of vehicles user association method based on multi-agent deep reinforcement learning. The specific steps are as follows:

[0060] Step 1: initialize relevant parameters of the algorithm.

[0061] Initialize the relevant weight, offset and other parameters of the vehicle user's local online Q network and target Q network, as well as the hidden state parameters of the cyclic neural network layer. Both local online Q network and target Q network have two linear network layers and one gate recurrent unit (Gru) layer.

[0062] Step 2: each vehicle user obtains the local status information by observing the current environment, and then inputs it into the local network to obtain the corresponding Q value, and ε- ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a heterogeneous Internet of Vehicles user association method based on multi-agent deep reinforcement learning, and the method comprises the steps: firstly modeling a problem into a partially observable Markov decision process, and then employing the idea of decomposing a team value function, and specifically comprises the steps: building a centralized training distributed execution framework, the team value function is connected with each user value function through summation so as to achieve the purpose of implicit training of the user value functions; and then, referring to experience playback and a target network mechanism, performing action exploration and selection by using an epsilon-greedy strategy, storing historical information by using a recurrent neural network, selecting a Huber loss function to calculate loss and perform gradient descent at the same time, and finally learning the association strategy of the heterogeneous Internet of Vehicles users. Compared with a multi-agent independent deep Q learning algorithm and other traditional algorithms, the method provided by the invention can more effectively improve the energy efficiency and reduce the switching overhead at the same time in a heterogeneous Internet of Vehicles environment.

Description

technical field [0001] The invention relates to the field of wireless communication technology, in particular to a user association method of heterogeneous vehicle networking based on multi-agent deep reinforcement learning. Background technology [0002] With the rapid development of economy in recent years, the number of cars in the world is increasing day by day. While cars bring convenience to people, the probability of traffic congestion and traffic accidents is also greatly increased. Therefore, vehicular ad hoc networks (VANETs) are born. VANETs uses advanced wireless communication technology and perception technology to connect vehicles, pedestrians and roads within a certain range, comprehensively perceive traffic and roads, and form a special mobile ad hoc network (Cao s, Lee V c.an accurate and complete performance modeling of the ieee802.11p MAC sublayer for VANET [J]. Computer communications, 2020149:107-120). [0003] Due to the high mobility of the Internet of vehi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): H04W4/44H04W4/46H04W4/02H04W4/021G06N3/04G06N3/08
CPCH04W4/44H04W4/46H04W4/025H04W4/023H04W4/021G06N3/08G06N3/045Y02D30/70
Inventor 陶奕宇林艳包金鸣张一晋邹骏李骏束锋
Owner NANJING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products