Credibility-based random forest soil heavy metal risk evaluation method and system

A risk assessment and random forest technology, applied in the field of artificial intelligence, can solve problems such as being unable to deal with unbalanced data sets

Pending Publication Date: 2021-04-09
WUHAN POLYTECHNIC UNIVERSITY
View PDF4 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Aiming at the defect that the existing technology cannot deal with unbalanced data sets, the technical soluti...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Credibility-based random forest soil heavy metal risk evaluation method and system
  • Credibility-based random forest soil heavy metal risk evaluation method and system
  • Credibility-based random forest soil heavy metal risk evaluation method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] The technical scheme of the invention can adopt computer software to support the automatic operation process. The technical solution of the present invention will be described in detail below in conjunction with the drawings and embodiments.

[0051] Aiming at the defects of the prior art, the present invention proposes the weighting of a random forest-based learner based on the true positive rate and uses it for soil heavy metal pollution risk assessment. see figure 1 , the reliability-based random forest soil heavy metal risk assessment method provided by the embodiments of the present invention comprises the following steps:

[0052] (1) Data preprocessing: including preprocessing the original data set to obtain an unlabeled data set; including data labeling, including selecting a soil pollution evaluation method to mark whether the sample has a pollution risk.

[0053] Furthermore, the present invention preferably proposes to mark the data set as a binary classifi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a credibility-based random forest soil heavy metal risk evaluation method and system, and the method comprises the steps: carrying out the data preprocessing: carrying out the preprocessing of an original data set, and obtaining an unmarked data set; marking data, namely marking whether the samples in the data set have pollution risks or not by adopting a soil pollution evaluation method; dividing a data set, namely performing stratified sampling on the data set according to proportions of different categories, and dividing a training set and a test set; model training: learning the training set by using a random forest algorithm based on true positive rate weighted voting to obtain a risk evaluation model, and inputting the test set into the model to obtain a risk evaluation result, using a Bayesian optimization algorithm to find a parameter combination with the highest accuracy rate by taking the accuracy rate as an optimization target, and and carrying out soil heavy metal risk evaluation and evaluation by utilizing the trained model. According to the method, the recall rate of few types of samples is improved on the unbalanced data set, and the pollution-free risk of the samples can be accurately distinguished.

Description

technical field [0001] The invention belongs to the field of artificial intelligence, and mainly relates to a reliability-based random forest soil heavy metal risk assessment method. Background technique [0002] Among the soil inorganic pollutants, heavy metals are more prominent, mainly because heavy metals cannot be decomposed by soil microorganisms, but are easy to accumulate and transform into more toxic methyl compounds, and some even accumulate in the human body at harmful concentrations through the food chain, seriously endangering human health. Soil heavy metal pollution seriously threatens the safety of ecosystems and agricultural products. [0003] At present, there are studies that use the random forest algorithm to evaluate the impact factors of soil heavy metal content, such as the patent document with the application number CN201610997260.X. However, the classification accuracy of the random forest algorithm is still insufficient: [0004] Random Forest alg...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06Q10/06G06N3/00G06N20/00
CPCG06Q10/0635G06N3/006G06N20/00
Inventor 张聪喻子言王恒张俊杰曹文琪胡殿涛
Owner WUHAN POLYTECHNIC UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products