Mobile sink path planning method based on deep reinforcement learning algorithm

A technology of reinforcement learning and path planning, applied in the transmission system, network topology, wireless communication, etc., can solve problems such as difficult battery replacement, fast energy consumption, and energy voids, and achieve high network efficiency, good real-time performance, and reduced complexity degree of effect

Inactive Publication Date: 2018-11-30
BEIJING UNIV OF TECH
View PDF0 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In a traditional wireless sensor network, the positions of all nodes are fixed, and most of the sensor nodes are powered by batteries, and it is difficult to replace the batteries once deployed
In WSN, the most typical data collection method is that sensor node data is transmitted to the base station or sink node in a multi-hop manner. Sensor nodes close to the base station or sink carry more forwarding tasks, and energy consumption is faster, resulting in energy holes. and hot spots

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Mobile sink path planning method based on deep reinforcement learning algorithm
  • Mobile sink path planning method based on deep reinforcement learning algorithm
  • Mobile sink path planning method based on deep reinforcement learning algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The present invention will be further described below in conjunction with accompanying drawing.

[0028] The present invention uses a deep reinforcement learning algorithm to plan the mobile sink path in real time, and the process of the depth reinforcement learning algorithm is described below:

[0029] The present invention uses a greedy strategy to select the action of the sink, that is, the action value is randomly generated with a certain probability. By continuously reducing the probability of the action value generated by the greedy strategy and increasing the probability of the action value generated by the policy network, this can prevent the policy network from falling into a local optimum.

[0030] The state of the present invention is an RGB image, a grid divided by the entire wireless sensor network area (such as figure 1 Shown) is mapped according to the data priority, such as figure 2 As shown, the network state complexity is low.

[0031] The action ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a mobile sink path planning method based on a deep reinforcement learning algorithm. A mobile sink path is planned by using a deep reinforcement learning algorithm. A rasterized network state is mapped to an RGB image and the RGB image is input into a deep convolutional neural network, and network parameters are updated constantly through training. In practical application,an actual network state is only input into a trained neural network, so as to obtain an optimal traveling path of sink. The method considers a data delay requirement and network energy consumption ofa wireless sensor network comprehensively. Compared with a traditional wireless sensor network, the method can balance network energy consumption effectively and can improve energy efficiency. Rasterisation of the network state makes network state less complex.

Description

technical field [0001] The invention belongs to the technical field of wireless sensor networks, and in particular relates to a mobile sink path planning method based on a deep reinforcement learning algorithm. Background technique [0002] The wireless sensor network is composed of a large number of sensor nodes deployed in the network area, the aggregation nodes for collecting information, and the management nodes. The sensor nodes communicate with each other in a multi-hop manner, forming a multi-hop self-organizing network. The wireless sensor network can collect, process and transmit data in the area. In the network coverage area, the sensor nodes collect and process the data, and forward them to other sensor nodes or sink nodes. [0003] In a traditional wireless sensor network, the positions of all nodes are fixed, and most of the sensor nodes are powered by batteries, and it is difficult to replace the batteries once they are deployed. In WSN, the most typical data ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04W40/04H04W40/32H04W84/18H04L12/24
CPCH04L41/044H04L41/145H04W40/04H04W40/32H04W84/18Y02D30/70
Inventor 司鹏搏刘雯琪张正徐广书郝国超于航张延华
Owner BEIJING UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products