Unlock instant, AI-driven research and patent intelligence for your innovation.

Sample data cleaning method, device, computer device, and storage medium

A technology of sample data and preset distance, applied in the field of data processing, can solve the problem of low accuracy of training sample data

Active Publication Date: 2019-01-18
PING AN TECH (SHENZHEN) CO LTD
View PDF8 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Embodiments of the present invention provide a sample data cleaning method, device, computer equipment, and storage medium to solve the problem of low accuracy of training sample data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sample data cleaning method, device, computer device, and storage medium
  • Sample data cleaning method, device, computer device, and storage medium
  • Sample data cleaning method, device, computer device, and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0028] The sample data cleaning method provided by the embodiment of the present invention can be applied in such as figure 1 In the application environment of , the client (computer device) communicates with the server through the network. The client collects or obtains the initial image set, and sends the initial image set to the server, and the server processes the initial image set to finally obtain the target training set. Among them, the client ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a sample data cleaning method, a device, a computer device and a storage medium. Firstly, an initial image set is obtained, and the initial image set is input to a feature classification model for recognition to obtain a feature recognition result. Then, according to the feature recognition result, an initial training set is obtained, and the initial training set comprisesan initial training image and labeling data corresponding to each initial training image. In order to ensure the data richness of the training data, the initial training set is obtained by selecting the image data in the initial image set according to the preset requirements through the feature classification model. On this basis, the initial training images are classified according to the annotation data, and the classification training set is obtained. And the classification training set is cleaned to obtain the target training set. On the basis of ensuring the data richness of the trainingdata, the training data is cleaned to ensure the accuracy of the training data, in order to further improve the accuracy of the follow-up model training.

Description

technical field [0001] The present invention relates to the field of data processing, in particular to a sample data cleaning method, device, computer equipment and storage medium. Background technique [0002] With the development of computer technology, deep learning has been widely used in various fields. The training of deep learning requires a large number of training samples. If there are fewer training samples, the effect will be much worse. For training samples, traditionally, the existing training sample data is obtained from the network or a third-party data platform for model training. However, for many specific application scenarios, the existing training sample data in these networks or third-party data platforms may not be able to meet actual needs, so many sample data need to be collected manually, which makes it very inconvenient to obtain training sample data . Moreover, due to the relatively large amount of data in the training samples, there are often e...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/62G06N3/04
CPCG06V40/50G06N3/045G06F18/22G06F18/214Y02D10/00
Inventor 徐玲玲
Owner PING AN TECH (SHENZHEN) CO LTD