Construction method for classifier

A construction method and classifier technology, applied in the field of data processing, can solve the problems of low classification accuracy of recognition or classification technology, achieve the effect of improving data imbalance, simple method, and improving accuracy

Inactive Publication Date: 2015-07-08
HARBIN INST OF TECH
View PDF0 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention provides a method for constructing a classifier, the purpose of which is to solve the problem of low accuracy of the existing identification or classification technology for data such as flow data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Construction method for classifier
  • Construction method for classifier
  • Construction method for classifier

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] In order to make the purpose of the invention, technical solutions and beneficial effects of the present invention clearer, the embodiments of the present invention will be described below in conjunction with the accompanying drawings. The features in can be combined arbitrarily with each other.

[0025] An embodiment of the present invention provides a method for constructing a classifier, the construction method comprising: using the undersampling method to remove part of the majority class training samples in the training sample set, and updating the current training sample with the undersampled training sample set set, wherein the training sample set includes majority class training samples and minority class training samples, and each training sample in the training sample set has a known category; and the minority class training samples in the training sample set are oversampled to utilize the The processed training sample set is used to construct a classifier.

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a construction method for a classifier. The construction method includes the following steps that a part of majority class training samples in a training sample set are removed through an undersampling method, and a current training sample set is updated through the undersampled training sample set, wherein the training sample set comprises the majority class training samples and minority class training samples, and the classes of all the training samples in the training sample set are known; oversampling is conducted on the minority class training samples in the training sample set, and the classifier is constructed through the oversampled training sample set. According to the construction method for the classifier, noise in the training samples is removed effectively, the problem of data imbalance can be solved effectively, the accuracy rate of training sample data classification is greatly increased, the calculation amount is small, and the method is simple.

Description

technical field [0001] The invention relates to the field of data processing, in particular to a method for constructing a classifier. Background technique [0002] In the traffic identification method based on machine learning, noise data will appear in the traffic data set, which has a great impact on the identification of small sample data in unbalanced data. Therefore, data cleaning and denoising of network traffic has a great impact on the accuracy of classification has important meaning. For the machine learning method of network traffic classification, there may be a small part of noise traffic and a small part of useful traffic that need to be obtained in traffic identification. [0003] At present, the existing identification or classification technologies for data such as traffic data have low classification accuracy, and the calculations are large, the method is complicated, and it is too time-consuming. Contents of the invention [0004] The present invention...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06K9/66
Inventor 余翔湛叶麟张伟哲何慧张宏莉丛小亮王岳
Owner HARBIN INST OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products