Noise identification method for image data

A recognition method and image data technology, applied in the field of image recognition, can solve the problems of large loss value, low accuracy rate of noise samples, large difference between clean samples and category centers, etc., to achieve the effect of improving the accuracy rate

Pending Publication Date: 2022-05-27
SHENZHEN GRADUATE SCHOOL TSINGHUA UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the existence of difficult samples in clean samples, some clean samples will also have a large loss value and thus be judged as noise samples. The accuracy of detecting noise samples with a simple loss value threshold method is low
In other words, some clean samples are too different from the category center, which leads to being mistaken for noise samples

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Noise identification method for image data
  • Noise identification method for image data
  • Noise identification method for image data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] Embodiments of the present invention will be described in detail below. It should be emphasized that the following description is exemplary only, and is not intended to limit the scope of the invention and its application.

[0037] see figure 1 , an embodiment of the present invention provides a noise identification method for image data, including the following steps:

[0038] S1. In the noise learning based on neural network, a Gaussian mixture distribution model is used, and the combination of the loss value of the sample to be detected and the maximum non-target probability is used as a training dynamic to jointly model; wherein, the maximum non-target probability refers to, Among the classification and identification probabilities output by the sample after passing through the neural network, the probability is the largest except the target class probability. This largest non-target probability records the characteristics of the degree of confusion between the sam...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A noise identification method for image data comprises the following steps: S1, in noise learning based on a neural network, using a Gaussian mixture distribution model, and taking a loss value of a to-be-detected sample and a maximum non-target probability as training dynamics for joint modeling; wherein the maximum non-target probability refers to the maximum probability, except the target class probability, in each classification identification probability output after the sample passes through the neural network; s2, calculating a loss value and a maximum non-target probability of a to-be-detected sample by using the trained neural network model, fitting a Gaussian mixture distribution model by using the calculated loss value and the maximum non-target probability, and outputting a probability that the to-be-detected sample belongs to a clean sample and a noise sample; and dividing the to-be-detected sample into a clean sample or a noise sample according to the probability. According to the method, the distance information between the sample and the center of the class and the confusion information between the sample and the centers of other classes are considered at the same time, the difficult sample and the noise sample are effectively separated, and the noise detection accuracy is improved.

Description

technical field [0001] The present invention relates to image recognition, in particular to a noise recognition method in image classification tasks. Background technique [0002] Abbreviations and terms: [0003] Learning with Noise: This task refers to how to learn a high-performance model on a dataset containing noisy labels. The training of deep neural networks usually requires a large number of clean samples, but in practical application scenarios, obtaining large-scale, high-quality clean labels faces problems such as high labeling cost, high time overhead, and low labeling quality. For example, in the crowdsourcing scenario, the company usually invites several annotators to annotate a large number of unlabeled samples. However, due to the uneven annotation capabilities of different annotators and mislabeling, etc., a large amount of annotation costs and time costs are incurred. Get a dataset with noisy labels. In addition, it is a low-cost way to obtain samples by ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06V10/764G06V10/774G06K9/62G06N3/04
CPCG06N3/04G06F18/2155G06F18/2413
Inventor 袁春王子啸
Owner SHENZHEN GRADUATE SCHOOL TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products