A Method for Learning Labeled Noisy Images Based on Dual Active Query

An active, image technology, applied in the field of image learning, can solve the problem of noise label noise transfer matrix estimation bias, and achieve the effect of avoiding estimation bias

Active Publication Date: 2021-10-01
NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The purpose of the present invention is to propose a label-noisy image learning method based on dual active query to solve the problem that the existing deep neural network is easy to overfit the noise label and the noise transfer matrix estimation deviation caused by the traditional active query preference, Maximize the learning accuracy of the classifier while saving the labeling cost

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Method for Learning Labeled Noisy Images Based on Dual Active Query
  • A Method for Learning Labeled Noisy Images Based on Dual Active Query
  • A Method for Learning Labeled Noisy Images Based on Dual Active Query

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] This embodiment describes a method for learning tagged images with noise based on dual active query.

[0026] Such as figure 1 As shown, the method includes the following steps:

[0027] Step 1. Get image set X and noise label set Y , forming a noisy data set D =(X, Y ).

[0028] The labels of the image set X can be obtained by manual labeling, such as asking experts or crowdsourcing, etc.; they can also be obtained by automatic collection, such as crawlers. However, the marks actually collected by these methods Y contains a large number of wrong labels.

[0029] If a deep neural network classifier model is directly learned on these real data sets, it will overfit the noisy data, resulting in a decrease in its own generalization performance. Existing learning theories suggest that:

[0030] The noise is modeled by the noise transfer matrix. When the noise transfer matrix is ​​estimated accurately, the optimal classifier on the noise data is equivalent to the opt...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a label noise image learning method based on double active query, which comprises the steps of: building a deep convolutional neural network classifier and a noise transfer matrix, and pre-training the classifier and the noise transfer matrix on a noisy data set; Select the image set and obtain the real label of the selected image set; use the selected image set and real label to update the noise transfer matrix; initialize the classifier based on the noise transfer matrix; construct the loss function on the real label and the loss function on the noise label; based on Stochastic gradient descent minimizes the true label loss and noise label loss, and updates the classifier parameters; repeats the iterative process to the maximum number of iterations K, and completes the training of the deep convolutional neural network classifier g. The invention uses the noise transfer matrix to establish a noise model, and introduces real labels to estimate the noise transfer matrix and improve the classifier; by designing a dual active query method, it saves labeling costs and maximizes the learning accuracy of the classifier.

Description

technical field [0001] The invention belongs to the technical field of image learning, and relates to a label-noisy image learning method based on double active query. Background technique [0002] The deep convolutional neural network image classifier model requires a large number of labels, but the actual collected labels often contain a lot of noise, and the deep neural network is easy to overfit the noisy data, which limits the generalization performance of the classifier model . [0003] Existing learning theory shows that by building a model for noise, if the noise model estimate is accurate, then the optimal classifier on noisy data is equivalent to the optimal classifier on real label data. [0004] The noise transfer matrix is ​​a commonly used noise model, which contains the flip probability between each category of the image, so as to realize the mapping between the real label probability distribution and the noise label probability distribution. Whereas estimat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06N3/047G06N3/045G06F18/2415G06F18/241
Inventor 李绍园侍野黄圣君
Owner NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products