XGBoost soft measurement modeling method based on parallel LSTM auto-encoder dynamic feature extraction

A technology of dynamic features and modeling methods, applied in neural learning methods, instruments, character and pattern recognition, etc., can solve the problems of difficult process dynamics, process nonlinearity, high data dimensions, etc., to speed up training and improve robustness. Sticky, easy-to-migrate effects

Active Publication Date: 2019-09-06
ZHEJIANG UNIV
View PDF8 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, the common data-driven soft sensor models mainly include: partial least squares, support vector machine, neural network, decision tree model, etc. These basic models can only meet the fitting and prediction of simple data, but in the actual industry In the process, there will be problems such as process nonlinearity, high-dimensional data, data noise, data non-Gaussian distribution, lack of quality variables, state drift and time-varying, process dynamics, and quality variable sampling delays. Improvements or combinations to accommodate more complex modeling problems
[0004] In the above practical problems, the process dynamics are often difficult to solve. The main reason for the process dynamics is that the samples at the current moment may have a relatively implicit relationship with the samples at several previous moments, and in terms of specific sample characteristics It is still difficult to reflect, and the more common solution at present is to use a time series model with a relatively simple structure such as ARIMA, and add the sample information of the previous moment to the model in a linear or non-linear form to improve the prediction accuracy, but this method is often There are two problems. One is that the dynamic characteristics of samples in different working conditions are not the same. For example, in some cases, the samples show linear fluctuation information, while in some cases it is non-linear. Therefore, it is difficult to use a more general time The sequence model is used to synthesize the dynamic characteristics between samples; the second is that with the gradual increase of industrial process data and the formation of industrial big data problems, the relatively simple time series model is difficult to effectively extract sufficient and accurate information from massive data, and the calculation It will also cost a lot of time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • XGBoost soft measurement modeling method based on parallel LSTM auto-encoder dynamic feature extraction
  • XGBoost soft measurement modeling method based on parallel LSTM auto-encoder dynamic feature extraction
  • XGBoost soft measurement modeling method based on parallel LSTM auto-encoder dynamic feature extraction

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0082] The beneficial effects brought by the present invention will be described below in conjunction with a specific example of a debutanizer. The debutanizer process is a commonly used standard industrial process platform for the verification of soft-sensing modeling algorithms, and it is the process of removing propane and butane from naphtha gas in industrial refinery processes. The bottom butane content requires soft-sensing techniques for estimation. Image 6 The structure of the debutanizer process is described in . Table 1 shows that the seven auxiliary variables selected for the key quality variable butane content are column top temperature, column top pressure, reflux flow rate, next stage flow rate, temperature of sensitive plate, column bottom temperature and column bottom pressure. For this process, 2345 data were collected continuously at equal time intervals, of which 1563 data were used as training sample sets for modeling, and the other 782 data samples were ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an XGBoost soft measurement modeling method based on parallel LSTM auto-encoder dynamic feature extraction, and belongs to the field of industrial process prediction and control. The LSTM auto-encoder extracts an encoding vector as a dynamic feature in a manner of restoring an input sequence; the idea of data parallelism and model parallelism is utilized to carry out distributed training on the network; the modeling efficiency is improved, the extracted dynamic features and the original features are combined to train an XGBoost model, and finally the steps of predictingthe test sample, repeatedly extracting and splicing the features, inputting the features into the XGBoost model and the like are carried out until prediction of all the samples is completed. The XGBoost soft measurement modeling method based on parallel LSTM auto-encoder dynamic feature extraction is helpful for solving dynamic problems in process soft measurement based on other models, and meanwhile, two modes of data parallelism and model parallelism are adopted in the network training process, so that the network training speed is increased; and it can be ensured that the precision of theXGBoost model is unchanged or stably improved after the dynamic characteristics are introduced, and the robustness of the model is improved.

Description

technical field [0001] The invention belongs to the field of industrial process prediction and control, and relates to an XGBoost soft sensor modeling method based on parallel LSTM self-encoder dynamic feature extraction. Background technique [0002] The soft measurement of key variables in industrial processes refers to a method of predicting the measured variables by establishing a mathematical model of the measured variables and other variables. It is mainly used to estimate variables that are difficult or costly to measure in actual industrial processes. At present, soft sensor models are mainly divided into mechanism models and data-driven models. Because the derivation process of mechanism models is often very complicated, data-driven soft sensor models are widely used. [0003] At present, the common data-driven soft sensor models mainly include: partial least squares, support vector machine, neural network, decision tree model, etc. These basic models can only meet...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/46G06K9/62G06N3/04G06N3/08
CPCG06N3/049G06N3/08G06V10/462G06N3/045G06F18/214
Inventor 葛志强张鑫宇
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products