Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Data denoising method based on mark risk control

A risk control and data technology, applied in neural learning methods, neural architectures, biological neural network models, etc., can solve the problems of difficulty in collecting data, consuming a lot of manpower and material resources, etc., achieving strong robustness, good image noise removal, and prevention. The effect of learning performance degradation

Pending Publication Date: 2021-08-20
NANJING UNIV
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the performance of deep learning depends on a large amount of accurately labeled data. In practical applications, it is often difficult to collect a large amount of accurately labeled data, because obtaining accurate data labels requires a lot of manpower and material resources.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data denoising method based on mark risk control
  • Data denoising method based on mark risk control
  • Data denoising method based on mark risk control

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0013] Below in conjunction with specific embodiment, further illustrate the present invention, should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various equivalent forms of the present invention All modifications fall within the scope defined by the appended claims of the present application.

[0014] Such as figure 1 As shown, the data denoising method based on labeling risk control maintains two neural networks based on the small loss criterion and selects data with small loss as low-risk data to update the peer-to-peer network, and each network finds and removes high-risk data. And retrain on the remaining data, pay attention to the inconsistency of the two networks during the training process, if the inconsistency tends to be stable or the number of learning rounds reaches the preset...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a data denoising method based on mark risk control, and aims to solve the problem that the success of data deep learning often depends on a large amount of accurately marked data, but a large amount of accurately marked data is usually difficult to collect in an actual scene. In order to reduce the influence of data marking noise on the performance of the neural networks, the method maintains the two neural networks to mutually select data with small loss as low-risk data to update the peer-to-peer network, and each network filters out high-risk data in the network and retrains the remaining data. Along with training, the two networks are more and more similar to cause learning performance degradation, in order to solve the problem, when the inconsistency of the two neural networks is stable, mutual data selection is stopped, and the obtained low-risk data is utilized to update the networks until convergence. Compared with the prior art, the invention enables the deep neural network to have higher robustness.

Description

technical field [0001] The invention relates to a data denoising method based on marker risk control, which can screen out high-risk marker data and improve robustness, and belongs to the technical field of computer artificial intelligence data analysis. Background technique [0002] In recent years, deep learning has achieved great success in various fields, such as face recognition, automatic driving, machine translation and so on. However, the performance of deep learning depends on a large amount of accurately labeled data. In practical applications, it is often difficult to collect a large amount of accurately labeled data, because obtaining accurate data labels requires a lot of manpower and material resources. To solve this problem, people usually use crowdsourcing technology to allocate a large amount of unlabeled data to voluntary users for labeling. Due to the uneven level of users, the obtained labels are often noisy. How to learn from data with labeled noise has...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06N3/04G06N3/08
CPCG06N3/08G06N3/045
Inventor 王魏胡圣佑
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products