Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A multi-classifier training method and classification method based on non-deterministic active learning

A multi-classifier, active learning technology, applied in the fields of instruments, character and pattern recognition, computer parts, etc., can solve the problems of inability to achieve classification performance, inability to accurately describe the true distribution of sample data, etc., to achieve comprehensive and effective measurement, classification The effect of optimizing the effect and avoiding information redundancy

Active Publication Date: 2017-10-24
INST OF INFORMATION ENG CHINESE ACAD OF SCI
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The traditional multi-classification method based on deterministic active learning only focuses on model tuning and ignores model changes, so it is only suitable for application scenarios where the number of categories is known; while the number of categories is uncertain, the multi-classification method based on deterministic active learning The method is limited to the evaluation of the amount of sample information under the existing N classification model, but it cannot accurately describe and fit the real distribution of the sample data, so that it cannot effectively improve the classification performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A multi-classifier training method and classification method based on non-deterministic active learning
  • A multi-classifier training method and classification method based on non-deterministic active learning
  • A multi-classifier training method and classification method based on non-deterministic active learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0079] The principles and features of the present invention are described below in conjunction with the accompanying drawings, and the examples given are only used to explain the present invention, and are not intended to limit the scope of the present invention.

[0080] Example based non-deterministic active learning method for multi-classification

[0081] The multi-classification method based on non-deterministic active learning provided by the invention realizes the gradual optimization of the classification model through a cyclic iterative process.

[0082] Assuming that each round of loop iteration needs to label K samples, the following process is executed inside each round of loop iteration:

[0083]

[0084]

[0085] After the method is executed, if the number of loop iterations is M, the total number of samples marked by experts through human-computer interaction is K×M.

[0086] Taking image classification as an example, the image sample is represented by a ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multi-classifier training method and classification method based on non-deterministic active learning. The method is: 1) select or initialize a multi-classifier; for each sample in the unmarked sample set, use the multi-classifier to calculate the overall information amount Info of the sample; the overall information amount is: model change information amount and The sum of model tuning information; 2) cluster the unlabeled sample set to obtain J subclasses; 3) select some unlabeled samples with the smallest overall information Info value from each subclass; Select K samples from among them for labeling and add them to the labeled sample set L; 4) Use the updated labeled set L as training data to retrain the multi-classifier; 5) Iteratively execute steps 1) to 4) to set the number of times; The resulting multi-classifier is then used to classify the unlabeled set. The invention realizes the comprehensive evaluation of the amount of sample information, thereby obtaining a highly efficient and intelligent multi-classifier.

Description

technical field [0001] The invention relates to a multi-classifier training method and classification method based on non-deterministic active learning, belonging to the technical field of software engineering. Background technique [0002] Data classification has always been a research hotspot, such as patent ZL 201010166225.6 "an adaptive cascade classifier training method based on online learning", patent ZL 200910076428.3 "a cross-domain text sentiment classifier training method and classification method" , Patent ZL 200810094208.9 "Document Classifier Generation Method and System". [0003] In the classification problem of massive data, "active learning" (reference: McCallum and K.Nigam, "Employing EM in pool-based active learning for text classification," inProc. of the 15th International Conference on Machine Learning, 1998, pp .350–358.) is a machine learning method that efficiently utilizes expert labeling. Its main idea is: the machine actively and targetedly sele...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62
CPCG06F18/2413G06F18/214
Inventor 张晓宇王树鹏吴广君
Owner INST OF INFORMATION ENG CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products