Driving strategy model training method and device

A driving strategy and driving equipment technology, applied in computational models, character and pattern recognition, instruments, etc., can solve problems such as large training costs, long time, and vehicle body damage, reduce the number of trials and errors, shorten the training process, The effect of improving efficiency

Active Publication Date: 2018-03-30
UISEE TECH BEIJING LTD
View PDF4 Cites 35 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] With the development and application of machine learning technology, for example, the development of reinforcement learning technology, in the existing automatic driving technology, for vehicles, especially the driving control of automatic driving vehicles, the reinforcement learning neural network trained by reinforcement learning algorithm To achieve this, the real-time state information of the vehicle is input to the reinforcement learning neural network, thereby outputting the corresponding driving strategy information. However, in the existing training of the reinforcement learning neural network, for each vehicle that nee

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Driving strategy model training method and device
  • Driving strategy model training method and device
  • Driving strategy model training method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The application will be described in further detail below in conjunction with the accompanying drawings.

[0027] In a typical configuration of the present application, a terminal, a device serving a network, and a computing device include one or more processors (CPUs), input / output interfaces, network interfaces, and memory.

[0028] Memory may include non-permanent storage in computer-readable media, in the form of random access memory (RAM) and / or nonvolatile memory, such as read-only memory (ROM) or flash RAM. Memory is an example of computer readable media.

[0029] Computer-readable media, including both permanent and non-permanent, removable and non-removable media, can be implemented by any method or technology for storage of information. Information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access mem...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention aims at providing a driving strategy model training method and device. The method comprises acquiring model parameter information corresponding to a driving strategy model of a driving device, wherein the model parameter information is determined by training the driving strategy model on the basis of preset driving rule information, and the driving strategy model is established on the basis on reinforcement learning algorithms; acquiring the driving parameter information of the driving device during a driving process, and based on the model parameter information, training the driving strategy model. Compared with the prior art, the driving strategy model training method avoids exploration from scratch on training the driving strategy model, instead, the driving device can drive just like having learnt driving rules before training starts, so that the training process of the driving strategy model on the basis can be greatly shortened, and meanwhile, the number of times ofunreasonable driving strategies as well as damage to vehicles during the training process can be greatly reduced.

Description

technical field [0001] The present application relates to the field of automatic driving, and in particular to a technology for training a driving strategy model. Background technique [0002] With the development and application of machine learning technology, for example, the development of reinforcement learning technology, in the existing automatic driving technology, for vehicles, especially the driving control of automatic driving vehicles, the reinforcement learning neural network trained by reinforcement learning algorithm To achieve this, the real-time state information of the vehicle is input to the reinforcement learning neural network, thereby outputting the corresponding driving strategy information. However, in the existing training of the reinforcement learning neural network, for each vehicle that needs to be trained, It is necessary to continuously train the corresponding neural network parameters from scratch. However, in practical applications, for differe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62G06N99/00
CPCG06N20/00G06F18/214
Inventor 许稼轩周小成
Owner UISEE TECH BEIJING LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products