Shift strategy dynamic-optimization method based on deep reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A shift strategy and reinforcement learning technology, applied in motor vehicles, non-electric variable control, two-dimensional position/channel control, etc., can solve problems such as low intelligence and versatility, inability to dynamically optimize, and poor adaptive ability. , to achieve the effect of strong versatility, solving the Bellman latitude disaster, and strong adaptive ability

Active Publication Date: 2020-01-21

NANJING UNIV OF SCI & TECH

View PDF10 Cites 11 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Its disadvantage is that when dynamic programming solves the shifting law, it needs to build a complex state diagram, which is expressed in the form of a table

[0008] In general, most of the existing shift strategy solving or optimization methods cannot be dynamically optimized for the actual driving conditions, and the adaptive ability is poor

Some shift strategies that can be dynamically optimized need to manually formulate dynamic update rules for the shift strategy, which is less intelligent and versatile

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0084] The present invention provides a dynamic optimization method of shift strategy based on deep reinforcement learning. The invention constructs the Markov decision process of the gear shifting strategy, and then uses the deep reinforcement learning method to solve the gear shifting strategy. After the solution is completed, the prediction Q network trained by deep reinforcement learning is put into the shift strategy controller to realize the gear selection. Then, during the driving process, the prediction Q network is updated by collecting construction machinery and vehicle driving data to realize the dynamic optimization of the shift strategy. The update method of the predicted Q-network includes: updating the predicted Q-network by reconstructing the shifting strategy transfer function according to the construction machinery and vehicle driving data, and directly updating the predicted Q-network according to the deep reinforcement learning method. The principle of the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention belongs to the fields of engineering machinery and vehicle engineering, and particularly relates to a shift strategy dynamic-optimization method based on deep reinforcement learning. Theshift strategy dynamic-optimization method based on deep reinforcement learning comprises the following steps that (1), the state input variable and action output variable of a shift strategy are determined; (2), the Markov decision process of the shift strategy is determined according to the state input variable and the action output variable; (3), according to the shift strategy objective, a reward function of the reinforcement learning shift strategy is established; (4), the deep reinforcement learning shift strategy is solved according to the Markov decision process and the reward function; (5) a prediction Q network calculated in the step (4) is put in a shift strategy controller, during the driving process of engineering machinery and vehicles, the engineering machinery and the vehicles select gears according to the shift strategy controller; and (6), the prediction Q network is regularly updated in the driving process. The shift strategy is updated through a deep reinforcementlearning method to realize the dynamic optimization of the shift strategy.

Description

technical field [0001] The invention belongs to the field of construction machinery and vehicle engineering, and in particular relates to a dynamic optimization method for a shift strategy based on deep reinforcement learning. Background technique [0002] Gear shift strategy is one of the core technologies of construction machinery and vehicle control technology at present, which refers to the law that the gear position changes with the selected parameters during the driving process of construction machinery and vehicles. The solution method is the key consideration in establishing the shift strategy. The solving methods of shift strategy include graphical method, analytical method, genetic algorithm, dynamic programming method and so on. The solution and optimization of the shift strategy is the core direction of the research on the shift strategy, especially the dynamic optimization of the shift strategy is one of the difficulties in the research of the shift strategy. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G05D1/02

CPCG05D1/0223G05D1/0221

Inventor 陈刚袁靖张介顾爱博周楠王和荣苏树华陈守宝王良模王陶

Owner NANJING UNIV OF SCI & TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Shift strategy dynamic-optimization method based on deep reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology