Unlock instant, AI-driven research and patent intelligence for your innovation.

A random forest-based personnel risk analysis method

A random forest and analysis method technology, applied in the field of personnel risk analysis, can solve the problems of low model accuracy, inability to treat differently, low evaluation reliability and validity, etc., to achieve easy understanding and grasp, easy to understand and grasp , the effect of reducing the cost of labeling

Active Publication Date: 2021-11-16
广西友迪资讯科技有限公司
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] At present, there are mainly three types of personnel risk assessment: one is empirical assessment. Due to the limitation of the conditions at the time, the risk assessment method is mainly defined by the evaluator’s intuitive feeling of the transformation performance. This type of method relies on the evaluator’s subjective judgment. The accuracy rate is not high, and the key to the evaluation process is to rely on the quality of the evaluator. For the same person to be evaluated, the conclusions of different evaluators may be quite different; The accuracy rate no longer depends on the subjective judgment of the evaluator, but the preparation of mature and effective scales takes a very long time, and most scales cannot be treated differently with changes in social environment, geographical region, cultural customs, and causes of formation; the third The first is a risk assessment method driven by machine learning data, relying on multi-dimensional data accumulated over the years, using machine learning algorithms to train classification models, and then classifying the evaluators through the trained model
[0003] The risk assessment method based on machine learning is still in the exploratory stage in general, but in the actual operation process, there are generally problems of difficult operation and low evaluation reliability and validity
Existing methods are all based on experience, mostly in the form of classification, that is, to divide the personnel to be evaluated into two categories of dangerous and non-dangerous, or into a few categories such as high-risk, medium-risk, and low-risk, lacking quantitative indicators of risk levels , it is impossible to accurately distinguish the different risk levels of different persons to be evaluated in a category; secondly, the labeling method of the training data set depends on subjective judgment or scale test results, not only the cost of labeling is high, but both subjective judgment and scale test as mentioned above There is a large error, resulting in low accuracy of the training data set, and the model trained by the error training data is bound to be inaccurate
Although some methods can obtain quantitative indicators of risk levels, it is difficult to explain the meaning of their values, and it is difficult for users to understand and grasp them.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A random forest-based personnel risk analysis method
  • A random forest-based personnel risk analysis method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0049] The risk analysis method of personnel based on random forest, including the selection of objective function and features, training model and evaluation process; among them,

[0050] (1) Target category selection:

[0051] Select a dimension YD as the target category in the multi-dimensional data of personnel. The YD is related to the degree of danger. YD is binary. The value of YD is 1 or 0, where 1 indicates a category with a large risk tendency and 0 indicates a risk tendency Small categories, corresponding to the two categories of yes and no;

[0052] Whether YD chooses to be listed as a key point, whether to use equipment, whether to use restrictive protection measures or management levels;

[0053] The management level needs to be preprocessed into multiple binary dimensions. First, set the management level as strict management, common management, and wide management. Each level is divided into strict management, general management, and wide management, that is O...

Embodiment 2

[0077] According to the method of Example 1, the present invention is implemented in two units (hereinafter referred to as unit A and unit B), using the personnel data of unit A since 2016-09-01 after data cleaning, eliminating errors and poor quality data , construct the initial data set PreTrainSet, including 4113 samples; use "whether it is listed as a key point" as YD, apply the method of the present invention to train the personnel risk assessment model, and then use the model obtained from the training to the 2312 personnel who have entered complete data in unit B Personnel conduct risk assessment, and calculate the risk of the assessed personnel as the multiple SPYD of the average level. After sorting the obtained SPYD, 20 persons with the maximum value and minimum value of SPYD were selected, and the experts subjectively and manually evaluated their risks. The results are as follows:

[0078] Experts determine the number of people at risk Number of people...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a random forest-based personnel risk analysis method, which relates to the field of machine learning and comprises a target category selection process, a training process and an evaluation process. Comprising the following steps: selecting YD related to a danger degree from personnel multi-dimensional data as a target category; setting a danger degree index PYD, and establishing a training data set PreTrain Set set; training a random forest regression model to obtain RFM, GPYD and LNPY; calculating to obtain that the risk of the evaluated personnel is a multiple SPYD of the average level, so that the relative dangerousness can be visually evaluated. According to the evaluation method, the training data do not contain subjective judgment or scale test results of people, the model quality is higher, meanwhile, the data labeling cost is reduced, the model updating speed is increased, and the model can adapt to environment changes more quickly and at lower cost.

Description

technical field [0001] The invention relates to the field of machine learning, in particular to a random forest-based personnel risk analysis method. Background technique [0002] At present, there are mainly three types of personnel risk assessment: one is empirical assessment. Due to the limitation of the conditions at the time, the risk assessment method is mainly defined by the evaluator’s intuitive feeling of the transformation performance. This type of method relies on the evaluator’s subjective judgment. The accuracy rate is not high, and the key to the evaluation process is to rely on the quality of the evaluator. For the same person to be evaluated, the conclusions of different evaluators may be quite different; The accuracy rate no longer depends on the subjective judgment of the evaluator, but the preparation of mature and effective scales takes a very long time, and most scales cannot be treated differently with changes in social environment, geographical region,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06Q10/06G06Q50/26G06K9/62G06N20/00
CPCG06Q10/06393G06Q10/0635G06Q50/26G06N20/00G06F18/24323
Inventor 许金礼廖淑珍陆宇升陶炜朱晓东吕思霖
Owner 广西友迪资讯科技有限公司