Shift strategy dynamic-optimization method based on deep reinforcement learning

A shift strategy and reinforcement learning technology, applied in motor vehicles, non-electric variable control, two-dimensional position/channel control, etc., can solve problems such as low intelligence and versatility, inability to dynamically optimize, and poor adaptive ability. , to achieve the effect of strong versatility, solving the Bellman latitude disaster, and strong adaptive ability

Active Publication Date: 2020-01-21
NANJING UNIV OF SCI & TECH
View PDF10 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Its disadvantage is that when dynamic programming solves the shifting law, it needs to build a complex state diagram, which is expressed in the form of a table
[0008] In general, most of the existing shift strategy solving or optimization methods cannot be dynamically optimized for the actual driving conditions, and the adaptive ability is poor
Some shift strategies that can be dynamically optimized need to manually formulate dynamic update rules for the shift strategy, which is less intelligent and versatile

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Shift strategy dynamic-optimization method based on deep reinforcement learning
  • Shift strategy dynamic-optimization method based on deep reinforcement learning
  • Shift strategy dynamic-optimization method based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0084] The present invention provides a dynamic optimization method of shift strategy based on deep reinforcement learning. The invention constructs the Markov decision process of the gear shifting strategy, and then uses the deep reinforcement learning method to solve the gear shifting strategy. After the solution is completed, the prediction Q network trained by deep reinforcement learning is put into the shift strategy controller to realize the gear selection. Then, during the driving process, the prediction Q network is updated by collecting construction machinery and vehicle driving data to realize the dynamic optimization of the shift strategy. The update method of the predicted Q-network includes: updating the predicted Q-network by reconstructing the shifting strategy transfer function according to the construction machinery and vehicle driving data, and directly updating the predicted Q-network according to the deep reinforcement learning method. The principle of the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the fields of engineering machinery and vehicle engineering, and particularly relates to a shift strategy dynamic-optimization method based on deep reinforcement learning. Theshift strategy dynamic-optimization method based on deep reinforcement learning comprises the following steps that (1), the state input variable and action output variable of a shift strategy are determined; (2), the Markov decision process of the shift strategy is determined according to the state input variable and the action output variable; (3), according to the shift strategy objective, a reward function of the reinforcement learning shift strategy is established; (4), the deep reinforcement learning shift strategy is solved according to the Markov decision process and the reward function; (5) a prediction Q network calculated in the step (4) is put in a shift strategy controller, during the driving process of engineering machinery and vehicles, the engineering machinery and the vehicles select gears according to the shift strategy controller; and (6), the prediction Q network is regularly updated in the driving process. The shift strategy is updated through a deep reinforcementlearning method to realize the dynamic optimization of the shift strategy.

Description

technical field [0001] The invention belongs to the field of construction machinery and vehicle engineering, and in particular relates to a dynamic optimization method for a shift strategy based on deep reinforcement learning. Background technique [0002] Gear shift strategy is one of the core technologies of construction machinery and vehicle control technology at present, which refers to the law that the gear position changes with the selected parameters during the driving process of construction machinery and vehicles. The solution method is the key consideration in establishing the shift strategy. The solving methods of shift strategy include graphical method, analytical method, genetic algorithm, dynamic programming method and so on. The solution and optimization of the shift strategy is the core direction of the research on the shift strategy, especially the dynamic optimization of the shift strategy is one of the difficulties in the research of the shift strategy. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G05D1/02
CPCG05D1/0223G05D1/0221G05D2201/0213
Inventor 陈刚袁靖张介顾爱博周楠王和荣苏树华陈守宝王良模王陶
Owner NANJING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products