Vehicle path planning method based on reinforcement learning

A technology of vehicle routing and reinforcement learning, applied in the field of intelligent transportation, to achieve the effect of improving transportation efficiency and reducing costs

Pending Publication Date: 2020-07-14
DALIAN MARITIME UNIVERSITY
View PDF3 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] According to the technical problem that the existing method proposed above does not match the actual situation that the traffic situation changes in each time period and the number of distribution nodes is uncertain, a vehicle path planning method based on reinforcement learning is provided.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Vehicle path planning method based on reinforcement learning
  • Vehicle path planning method based on reinforcement learning
  • Vehicle path planning method based on reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] In order to enable those skilled in the art to better understand the solutions of the present invention, the following will clearly and completely describe the technical solutions in the embodiments of the present invention in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments are only It is an embodiment of a part of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts shall fall within the protection scope of the present invention.

[0028] It should be noted that the terms "first" and "second" in the description and claims of the present invention and the above drawings are used to distinguish similar objects, but not necessarily used to describe a specific sequence or sequence. It is to be understood that the data so used are interchangeable under appropriate ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a vehicle path planning method based on reinforcement learning, which comprises the following steps: taking a state sequence of a client node as input information, sending the input information to a decision network, and selecting an action by the decision network according to an action value function and calculating and planning a vehicle traveling path. According to the method, the model is trained through the reinforcement learning algorithm based on historical distribution data, and therefore the purpose of dynamically planning the driving path under the condition that the road traffic condition and the number of distribution target nodes are changed is achieved. According to the method, complex and changeable road traffic conditions and delivery tasks with uncertain delivery target numbers in real life are considered, and the driving route is dynamically adjusted, so that the transportation efficiency is improved, and the cost is reduced.

Description

technical field [0001] The present invention relates to the field of intelligent transportation, in particular, to a vehicle path planning method based on reinforcement learning. Background technique [0002] Effective planning of vehicle routes is an important link in logistics management, public transportation and taxi passenger transportation, and operations in related fields, which helps to improve transportation efficiency and reduce costs. [0003] Oriol Vinyals et al. (Vinyals O, Fortunato M, Jaitly N. Pointer networks[C] Advances in Neural Information Processing Systems.2015:2692-2700.) proposed a simple but effective architecture called Pointer Net to learn combinatorial optimization problems , the model first uses machine learning methods to solve combinatorial optimization problems. Based on sequence-to-sequence and Neural Turing Machines, it uses a neural attention mechanism to solve the problem of variable-size output dictionaries, and uses supervised learning t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06Q10/04G06Q10/08G06Q50/26G06N3/04G06N3/08G06N20/00
CPCG06Q10/047G06Q10/08355G06Q10/0838G06Q50/26G06N3/08G06N20/00G06N3/045
Inventor 高健蒋佳浩
Owner DALIAN MARITIME UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products