Reinforcement learning unit matching recurrent neural network system and training and prediction method thereof

A recurrent neural network and reinforcement learning technology, applied in the field of reinforcement learning unit matching recurrent neural network system and its training and prediction, can solve the problem of unsatisfactory model prediction accuracy and computational efficiency, loss of sample validity and diversity, and inconsistent prediction results. Determine and other problems to avoid blind search, enhance generalization ability, and improve convergence speed

Active Publication Date: 2019-05-14
SICHUAN UNIV
View PDF5 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For PF, the resampling stage will cause loss of sample validity and diversity, leading to sample impoverishment phenomenon
Since the kernel function type and kernel parameters of SVR are still difficult to set accurately, the prediction results are also uncertain
There is no mature theoretical guidance for the selection of the number of hidden layers and the number of nodes of the artificial neural network. Generally, the selection is based on experience, which leads to unsatisfactory prediction accuracy and calculation efficiency of the model.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Reinforcement learning unit matching recurrent neural network system and training and prediction method thereof
  • Reinforcement learning unit matching recurrent neural network system and training and prediction method thereof
  • Reinforcement learning unit matching recurrent neural network system and training and prediction method thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] A reinforcement learning unit matching cycle neural network system, including a cycle neural network and a reinforcement learning unit, the cycle neural network includes input, output and a plurality of hidden layers, the hidden layer includes a plurality of hidden layer nodes, and also includes a monotonic trend Recognizer, the monotone trend recognizer judges the trend state of the time series constructed by the input, and the reinforcement learning unit is based on the trend state of the input time series, and selects a cycle in which the number of hidden layers and the number of nodes in the hidden layer are adapted to its changing law A neural network, wherein the trend state of the input time series includes multiple states, and each trend state corresponds to a cyclic neural network in which the number of hidden layers and the number of nodes in the hidden layer are adapted to its changing law.

[0038] In this embodiment, the trend state includes an upward trend ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a reinforcement learning unit matching recurrent neural network system and a training and prediction method thereof. The system includes recurrent neural networks, a reinforcement learning unit, and a monotone trend recognizer. The recurrent neural network includes input and output, and a plurality of hidden layers. The hidden layers include a plurality of hidden layer nodes. The monotonous trend recognizer judges the trend state of a time sequence constructed by input, and the reinforcement learning unit selects a recurrent neural network with the number of hidden layers and the number of hidden layer nodes matched with the change rule of the hidden layer nodes according to the trend state of the input time sequence. According to the invention, the trend state ofthe input time sequence is judged. The trend state, different hidden layer numbers and hidden layer node numbers are used for representing the state and action of the Q value set respectively, the optimal action is selected to be executed according to the updated Q value set, the generalization capability and convergence speed of the RLUMRNN are enhanced, and the residual life prediction method applied to the rotary machine is high in precision.

Description

technical field [0001] The invention relates to the technical field of neural networks, in particular to a reinforcement learning unit matching cycle neural network system and its training and prediction methods. Background technique [0002] Rotating machinery is one of the most widely used components in civil and national defense mechanical equipment. During long-term operation, rotating machinery will gradually degrade, and the remaining life will gradually decrease. The occurrence of failures often brings catastrophic accidents, resulting in major economic loss and serious social impact. For rotating machinery, the quality of the equipment can be improved through optimization of the design and manufacturing process, but it is still difficult to guarantee that there will be no failures during service. Even under the same working conditions, the rotating machinery of the same type and the same type has a large dispersion of life data due to different operating environment...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06N3/04G06N99/00G06F17/50
Inventor 李锋陈勇田大庆
Owner SICHUAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products