Evaluation method for performance influence degree of classification models by class imbalance

A technology of classification model and degree of influence, applied in character and pattern recognition, instruments, computer parts, etc., can solve the problems of classification imbalance, not fully considering the influence of class imbalance classification model, etc., and achieve high universality. Effect

Inactive Publication Date: 2016-01-13
CHINA UNIV OF MINING & TECH
View PDF2 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, while solving the classification imbalance problem, it often needs to be combined with a specific classification model or verified under c

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Evaluation method for performance influence degree of classification models by class imbalance
  • Evaluation method for performance influence degree of classification models by class imbalance
  • Evaluation method for performance influence degree of classification models by class imbalance

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0069] In order to better understand the technical content of the present invention, specific examples are given and described as follows in conjunction with the accompanying drawings.

[0070] figure 1 It is an overall framework diagram of a method for evaluating the degree of impact of class imbalance on the performance of a classification model in an embodiment of the present invention.

[0071] A method for evaluating the degree of impact of class imbalance on the performance of classification models, characterized by comprising the following steps.

[0072] S1 Classification model library construction, using typical classification algorithms in machine learning to build a classification model library, initialize the classification model and set the operating parameters of each model. At the same time, the classification model library is updatable, which can realize functions such as addition, modification and deletion of classification models.

[0073] S2 new data set ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an evaluation method for performance influence degree of classification models by class imbalance. The evaluation method comprises the following steps of (1) building a classification model base; (2) constructing a new data set; (3) forecasting the new data set by the classification models; (4) evaluating the performance of the classification models; and (5) evaluating an influence degree level. According to the evaluation method, firstly, a typical classification algorithm in machine learning is adopted to build the classification model base; secondly, a class imbalance data set is selected as a reference data set, a group of new data sets with imbalance ratio gradually increased is built on the basis, different classification models are selected to respectively classify and forecast the group of new data sets; and finally, a variable coefficient is adopted to evaluate the performance variation degree of the classification models and also carry out level division, thus, the influence degree of the class imbalance on the performance of different classification models is evaluated, and a guidance significance is played in research on the class imbalance process. With regards to different classification models, the evaluation method for performance influence degree of the classification models by class imbalance, provided by the invention, has high universality.

Description

technical field [0001] The invention belongs to the fields of data mining and machine learning, and relates to an evaluation method of a classification model, in particular to an evaluation method of the influence degree of class imbalance on the performance of the classification model. Background technique [0002] Classification is an important technology in the field of data mining. It refers to the process of building a classification model through learning on the data of known categories, and then predicting the data of other unknown categories. In the process of building a classification model, it is often necessary to combine algorithms or models in machine learning to improve the accuracy of classification. [0003] With the continuous development of data mining and machine learning, the problem of class imbalance has gradually become a research hotspot in these fields. In general, class imbalance refers to the imbalance in the distribution of sample sizes among dif...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62
CPCG06F18/217
Inventor 于巧姜淑娟张艳梅王兴亚
Owner CHINA UNIV OF MINING & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products