Multi-model fusion method and device, electronic equipment and computer readable storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A fusion method and model fusion technology, which is applied in the field of machine learning and can solve problems such as being unrobust, not suitable for continuous training of streaming data, and not suitable for semi-supervised learning methods in its basic form.

Pending Publication Date: 2022-08-05

深延科技(北京)有限公司

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Tree-based ensemble models can indeed achieve competitively accurate predictions, fast training, and easy interpretability, but compared with deep learning models, tree-based models still have the following limitations: (a) are not suitable for continuous training, and does not allow efficient end-to-end learning of image / text encoders in the presence of multimodal as well as tabular data; (b) in its basic form is not suitable for state-of-the-art semi-supervised learning methods; and (c), It is not suitable for the latest deep learning methods for dealing with missing data and data noise

Even so, TabTransformer still does not have good robustness on imbalanced datasets

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

example 1

[0101]

[0102] image 3 It is the model structure diagram of the fusion model obtained by the mean fusion method. like image 3 As shown, in the mean fusion method, the training results of the two sub-models trained in the model pre-training are used for model fusion. Specifically, use the trained TabTransformer and LGB on the test dataset (x Test ,y Test ) to make predictions and get the prediction results p respectively Tab_Test and p LGB_Test , and average the prediction results of the two, that is, p=(p Tab_Test +p LGB_Test ) / 2, as the prediction result of the fusion model.

example 2

[0103]

[0104] Figure 4 It is the model structure diagram of the fusion model obtained by the residual fusion method. like Figure 4 As shown, in the residual fusion method, the training data set (x T ,y T ), validation dataset (x V ,y V ) and the test dataset (x Test ,y Test ) to predict respectively, and get the prediction result p Tab_T , p Tab_V and p Tab_Test , and calculate the prediction result p respectively Tab_T , p Tab_V The residual between the predicted target y, i.e. get respectively and Then, replace the prediction targets in the training dataset and validation dataset with these residuals to get the replaced training dataset and the replaced validation dataset Train the LGB model with the replaced training dataset and validation dataset, and use the trained LGB model on the test dataset (x Test ,y Test ) to make predictions and get the prediction result p LGB_Test , and finally, with p=(p Tab_Test +p LGB_Test ) as the prediction res...

example 3

[0105]

[0106] Figure 5 It is the model structure diagram of the fusion model obtained by the neural network graft fusion method. like Figure 5 As shown, in the neural network graft fusion, the training data set (x T ,y T ) and the validation dataset (x V ,y V ) to predict and get the test result p LGB_T and p LGB_V . Then, insert the obtained test result into the network operation of TabTransformer. Specifically, because the test result belongs to numerical data, it is different from the original x cont The c consecutive features of , together form c+1 features, and are embedded with the context {h in the concatenation layer 1 ,…,h m } are concatenated together to form a vector of dimension (d×m+c+1), which is then fed into the represents the MLP layer to predict the target y. Here, the loss function is:

[0107]

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a multi-model fusion method and a related device, and the method comprises the steps: collecting table data, carrying out the data processing of the table data, and carrying out the grouping of the processed data, so as to obtain a training data set and a test data set; constructing a neural network model and a decision tree model; training at least one of the first decision-making tree model and the decision-making tree model by using the training data set, and testing the test data set by using the trained first decision-making tree model and / or the trained decision-making tree model to obtain a test result of the first decision-making tree model and / or the first decision-making tree model; performing model fusion by using a test result of the first decision tree model, or performing model fusion by using a test result of a trained sub-model and an untrained sub-model; and training the fused model to obtain a trained fused model. According to the method, the deep learning neural network and the tree model can be complementarily fused, and the table data prediction model which is more stable and higher in accuracy is obtained.

Description

technical field [0001] The present application relates to the technical field of machine learning, and in particular, to a multi-model fusion method, apparatus, device and computer-readable storage medium based on a deep neural network and a tree model. Background technique [0002] Tabular data is the most common type of data in many practical applications, such as recommender systems and online advertising. At this stage, the processing of tabular data is basically an ensemble method based on tree models, such as gradient boosted decision trees (GBDT). Tree-based models are the exact opposite of the current types of neural network-based deep learning models. Tree-based ensemble models do achieve competitively accurate predictions, fast training, and easy interpretability, but compared with deep learning models, tree-based models still have the following limitations: (a) they are not suitable for continuous analysis of streaming data training, and does not allow efficient...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06N20/20G06N5/00G06N3/08G06N3/04

CPCG06N20/20G06N3/08G06N5/01G06N3/045

Inventor 陈海波罗志鹏徐振宇

Owner 深延科技(北京)有限公司

Multi-model fusion method and device, electronic equipment and computer readable storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

example 1

example 2

example 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology