Representative image selection method based on multi-example active learning

An active learning, representational technique used in instrumentation, character and pattern recognition, computer components, etc.

Active Publication Date: 2019-07-05
ZHEJIANG UNIV OF TECH
View PDF10 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The representative image selection method in the currently published patents mainly focuses on the optimization of the clustering method, and some methods still require manual participation in labeling

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Representative image selection method based on multi-example active learning
  • Representative image selection method based on multi-example active learning
  • Representative image selection method based on multi-example active learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] The present invention will be described in detail below in conjunction with the embodiments and accompanying drawings, but the present invention is not limited thereto. The active learning framework refers to first using the initial sample to train the initial classifier, and using the initial classifier to predict all unlabeled samples in the original sample set; The samples with a large contribution to the classification accuracy of the classifier are handed over to the labeling experts for labeling; then the labeled samples are used together with the initial samples to train the classifier. Iterate in this way until the accuracy of the classifier does not change. At this time, although the classifier only uses a small number of labeled samples, it can achieve better classification accuracy.

[0045] The COCO data set is a target detection and classification data set provided by Microsoft. The images in the data set are mainly intercepted from complex daily scenes. Th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the field of machine learning, and specifically relates to a representative image selection method based on multi-example active learning, which comprises the following stepsof: (1) extracting original features of an image; (2) dimension reduction of original features; (3) carrying out original sample image pre-clustering by utilizing the dimensionality reduction characteristics; (4) selecting an initial training sample; (5) training a classifier; (6) adjusting the difficult-to-classify sample set; (7) adjusting the original sample set; (8) repeatedly executing the steps (5) to (7) to carry out iterative training until convergence; and (9) outputting a representative image. Screening out a sample set which contributes to the maximum classification precision of theclassifier from the original samples through pre-clustering, multi-example learning and active learning methods; and the samples are labeled for other machine learning tasks, so that the manpower consumed by labeling can be reduced, a part of noise samples can be filtered out, and the effective operation of other machine learning tasks is ensured.

Description

technical field [0001] The invention relates to the field of machine learning, in particular to a representative image selection method based on multi-instance active learning. Background technique [0002] With the rapid development of Internet technology, with the help of various Internet tools, people can quickly obtain a large amount of data from the Internet. However, the data obtained from the Internet is usually accompanied by more noise and a large amount of content redundancy. In the training process of the machine learning algorithm, if the original data is not cleaned and directly marked, it will not only cause a lot of waste of human resources, but also the algorithm is difficult to achieve the desired training effect due to the influence of noise. Taking advantage of the powerful analytical capabilities of weakly supervised learning, the acquired original data set can be screened first. [0003] According to the degree of training data labeling, machine learnin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/2135G06F18/24G06F18/214
Inventor 朱威王义锋陈悦峰滕游陈朋郑雅羽
Owner ZHEJIANG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products