Weak label data denoising method based on regularized label propagation

A label propagation and data noise reduction technology, applied in the direction of instruments, character and pattern recognition, computer components, etc., can solve the problems of low accuracy and low efficiency of noise reduction, improve the efficiency of reduction, optimize the quality of labeling, and improve noise reduction The effect of accuracy

Active Publication Date: 2019-03-29
NAT UNIV OF DEFENSE TECH
View PDF5 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The technical problem to be solved by the present invention is to provide a weakly labeled data noise reduction method based on regularized label propagation for the existing noise reduction methods that require manual experience to intervene, resulting in low noise reduction accuracy and efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Weak label data denoising method based on regularized label propagation
  • Weak label data denoising method based on regularized label propagation
  • Weak label data denoising method based on regularized label propagation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In the following, an example is used to describe the denoising method using the weakly labeled data denoising method based on regularized label propagation. as attached figure 1 as shown,

[0042] Step 1: Obtain the required weakly labeled sample dataset from the crowdsourcing data platform;

[0043] For large-scale data labeling, the crowdsourcing platform represented by Amazon (Amazon MechanicalTurk) is often used to distribute data on the Internet, labeling by network employees, and then combining multiple labeling results to give the final labeling of large-scale data sets . Although this labeling method makes good use of group wisdom and idle Internet resources, different employees have different professional fields and lack of expert experience, resulting in a certain proportion of data being incorrectly labeled. Large-scale weakly labeled data samples can be obtained through this crowdsourcing method that does not rely on domain expert experience. Although we...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a noise reduction method of weak labeling data based on regularized label propagation, which comprises the following steps: 1, obtaining a required weak labeling sample data set from a crowdsourcing data platform; 2, constructing a local neighborhood structure for that weakly labeled image sample data; 3, solving the sample nearest neighbor weighted similarity matrix of thelocal neighborhood structure; 4. Constructing a noise reduction model based on system state consistency for local neighborhood structure under weak supervision; 5, solving that noise reduction modelbased on the regularize label propagation method, and achieving the noise reduction of the weakly labeled sample data set. By constructing a local neighborhood structure of weak labeling data, Considering the whole weak labeling sample data set as a system, a denoising method of weak labeling data is proposed from the viewpoint of system consistency, which realizes the denoising of weak labeling data, optimizes the labeling quality of sample data set, and does not require manual experience or expert intervention in the denoising process, effectively improves the denoising accuracy and reducesthe efficiency.

Description

technical field [0001] The invention belongs to the field of data denoising, in particular to a method for denoising weakly labeled data based on regularized label propagation. Background technique [0002] In the context of big data, sample noise is ubiquitous in real life, industrial production, and engineering applications. These noises come from a wide range of sources, including system measurement errors of sensors and errors in data processing. The impact of noisy data removal on learning problems is manifold. Directly removing noisy data is a feasible strategy when the available data scale is large. However, some samples with beneficial information may be ignored, and the assumption of independent and identical distribution will no longer hold after the data is removed. In the case of a small data sample size, the removal of noisy data may directly lead to the inability to learn the model, resulting in a waste of data resources. [0003] In the field of machine lear...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/40G06K9/62
CPCG06V10/30G06F18/2135G06F18/214
Inventor 黄金才黄红蓝冯旸赫刘忠王琦程光权
Owner NAT UNIV OF DEFENSE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products