Unlock instant, AI-driven research and patent intelligence for your innovation.

Three-way decision active learning method taking neighborhood entropy as query strategy

An active learning and neighborhood technology, applied in the direction of instruments, character and pattern recognition, computer components, etc., to achieve the effect of improving performance

Pending Publication Date: 2021-12-24
GUILIN UNIVERSITY OF TECHNOLOGY
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At the same time, the neighborhood entropy is used as the decision function to map the unlabeled data set to three regions, and then the data in different regions are processed separately to solve the problem that the data in different regions have different values.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Three-way decision active learning method taking neighborhood entropy as query strategy
  • Three-way decision active learning method taking neighborhood entropy as query strategy
  • Three-way decision active learning method taking neighborhood entropy as query strategy

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] In this example, the data sets published on UCI (http: / / mlr.cs.umass.edu / ml / dataets.html) and KEEL (https: / / sci2s.ugr.es / keel / datasets.php) are selected for experimentation , to verify the validity of the method. In order to test the performance of the classifier, this embodiment selects 50 most valuable unlabeled data sets to mark, and uses the logistic regression classifier to classify the test set, and uses ACC and F1_Value as evaluation indicators to test its classification performance. In order to ensure the authenticity and reliability of the experimental results, the experiment was repeated 10 times, and the average value was taken as the final result. In order to illustrate the technical solutions of the present invention, specific examples are used below to illustrate. In this embodiment, the Australian data set on UCI is used for classification. This data set has 690 14-dimensional data, and the test set is The training set is The unlabeled dataset is U={...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a three-way decision active learning method taking neighborhood entropy as a query strategy. The method comprises the following steps: training a classifier by using a marked data set; classifying the test set by using the trained classifier and recording a classification result of the test set; calculating neighborhood entropies of all the unmarked data, and dividing the unmarked data into a positive domain, a boundary domain and a negative domain according to the size of the neighborhood entropies; respectively processing the data of different areas; selecting a part of most valuable unmarked data, and marking the unmarked data by a human expert or an annotation device; after marking, adding a marked data set and using the marked data set for the next training of the classifier; and executing the above processes in a loop iteration manner until a preset condition or an expected evaluation standard is reached, and stopping learning. According to the method, a small amount of most valuable data can be selected and marked, selection of redundant data and data which contribute little to classification performance is avoided, and meanwhile, the cost for marking a large amount of unmarked data can be reduced.

Description

technical field [0001] The invention belongs to the technical field of data mining and information processing, and in particular relates to a three-way decision-making active learning method using neighborhood entropy as a query strategy. Background technique [0002] In real life, unlabeled data is abundant and easy to obtain. These unlabeled data contain a lot of effective information. However, it takes a lot of manpower and material resources to manually extract useful information from a large amount of unlabeled data. Therefore, how never Mining the most effective information from labeled data has become a major research hotspot. Active learning (Active Learning) is one of the widely used machine learning methods, which aims to reduce the amount of labeled data required, that is, it only needs to use less training data to train a classifier with better performance. Active learning uses an appropriate query strategy to iteratively select the most valuable unlabeled data ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/217G06F18/214G06F18/24
Inventor 董明刚吕秋月
Owner GUILIN UNIVERSITY OF TECHNOLOGY