A method for predicting a sewage outlet index based on a random forest and a gradient lifting tree
A gradient boosting tree and random forest technology, applied in prediction, neural learning methods, computer components, etc., can solve the problems of model accuracy to be improved, slow training speed, etc., to improve the training rate and data quality, and reduce processing costs , the effect of improving the prediction accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment approach
[0069] As an implementation manner, the calculation of feature importance specifically includes the following steps:
[0070] For each regression tree (regression problem) in the random forest, use the corresponding out-of-bag sample to calculate its out-of-bag sample error, which is recorded as err 1 ;
[0071] Randomly add noise interference to the characteristics of the out-of-bag samples, and calculate its out-of-bag error again, which is recorded as err 2 ;
[0072] Among them, out-of-bag samples refer to sample data that are not used as training samples;
[0073] The formula for calculating the importance of a feature is:
[0074]
[0075] Among them, n is the sample number of out-of-bag samples, f is the sum of the out-of-bag sample error and the out-of-bag error added with noise interference, according to f as the value of the importance of a certain feature;
[0076] According to the calculated feature importance, the features are sorted and the important featu...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


