Internet of vehicles edge computing task unloading method based on hierarchical reinforcement learning

An edge computing and reinforcement learning technology, applied in neural learning methods, constraint-based CAD, computing, etc., can solve NP-hard problems and achieve the effect of excellent performance and low joint loss function

Pending Publication Date: 2021-10-29
广东利通科技投资有限公司 +1
View PDF0 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

On the premise of not destroying the subtask execution correlation, how to effectively deal with this type of application is still a big challenge;
[0006] (2) System loss function
[0007] (3) Computational complexity
The task offloading problem in the edge computing of the Internet of Vehicles is often constructed as a nonlinear mixed integer problem, which is NP-hard and cannot be solved in polynomial time [10]
The long solution time is unacceptable for tasks with high real-time requirements, so it is very important to design a task offloading algorithm that does not sacrifice performance and can solve quickly

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Internet of vehicles edge computing task unloading method based on hierarchical reinforcement learning
  • Internet of vehicles edge computing task unloading method based on hierarchical reinforcement learning
  • Internet of vehicles edge computing task unloading method based on hierarchical reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0072] Set the parameters of the example

[0073] Simulation environment: Python;

[0074] Simulation platform: such as figure 1 shown;

[0075] Reward discount factor: 0.99;

[0076] Learning rate of graph attention network: 0.001;

[0077] Learning rate for hierarchical action decision network: 0.01.

[0078] The task offloading method for edge computing of the Internet of Vehicles based on layered reinforcement learning, the specific steps are:

[0079] Step 1: Initialize the graph attention network Q g (s, a; θ g ), the hierarchical action decision network Q p (s, a; θ p ) and its target network Q′ p (s, a; θ' p ), where θ′ p = θ p , and initialize the experience playback pool D at the same time.

[0080] Step 2: Observing the current environment state s t , select and execute the hierarchical action a t ={(y t , k t =0, f t )∪(y t ,k t = 1,p t )}.

[0081] Step 3: Observe the next environment state st+1 And get a single step reward r t .

[0082] S...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of Internet of Vehicles edge computing, and particularly relates to an Internet of Vehicles edge computing task unloading method based on hierarchical reinforcement learning. Firstly, a task unloading problem in an edge computing network of the Internet of Vehicles is modeled as an optimization problem with a minimum delay-energy consumption-cost joint loss function as a target, wherein optimization parameters are a task execution sequence, a computing decision, local resource allocation and transmission power control; then, the application with task relevance is expressed in the form of a directed acyclic graph, implicit features in the application are mined through a graph neural network, and meanwhile the discrete continuous mixed action space is processed through a layered reinforcement learning algorithm; and a simulation experiment is carried out by taking the automobile speed adopted in a real environment as a data set, and a result shows that compared with a heuristic algorithm, the method disclosed by the invention can adaptively adjust task unloading and resource allocation strategies under various environmental parameters, so that a system loss function is more effectively reduced.

Description

technical field [0001] The invention belongs to the technical field of edge computing of the Internet of Vehicles, and in particular relates to a task offloading method for edge computing of the Internet of Vehicles based on layered reinforcement learning. Background technique [0002] With the continuous development of the Internet of Vehicles, applications such as assisted driving, augmented reality, and image processing have been gradually deployed in vehicle equipment to improve the driving experience of people in the vehicle [1]. These applications usually have two characteristics: large demand for computing resources and high real-time requirements. However, in-vehicle devices have limited computing resources and limited energy supply, which often cannot meet the demands of these applications [2]. [0003] By introducing cloud servers with powerful computing capabilities, mobile cloud computing (Mobile Cloud Computing, MCC) is regarded as an effective way to solve the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/445G06F9/50G06F30/15G06F30/27G06N3/04G06N3/08G06F111/04G06F111/06
CPCG06F9/44594G06F9/5038G06F9/5072G06F30/15G06F30/27G06N3/04G06N3/08G06F2209/509G06F2111/04G06F2111/06Y02D10/00
Inventor 徐跃东游新宇戴连贵邢万勇
Owner 广东利通科技投资有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products