Method, system and device for improving quality of classified learning data set and storage medium
A data set and data set technology, applied in the field of image classification, can solve the problems of reducing the size of the data set, worsening the performance of the training classifier, increasing the cost of data processing, etc., to reduce the error level, improve the generalization performance, reduce the The effect of error rate
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0040] Such as figure 1 As shown, the present invention provides a method for improving the quality of classification learning datasets, comprising the steps of:
[0041] S1. Using a pre-designed update method to update the data set;
[0042] S2. In response to detecting that the proportion of clean labels in the data set does not increase, output the data set;
[0043] S3. In response to detecting that the proportion of clean labels in the data set increases, update the data set again using a pre-designed update method;
[0044] Among them, the pre-designed update methods include:
[0045] Obtain the error transition probability matrix of the label through the network output of the anchor sample;
[0046] Obtain the error rate and weight of the label according to the error transition probability matrix of the label, and obtain the weighted average error rate of the data set according to the error rate and weight of the label;
[0047] The data samples are sorted according...
Embodiment 2
[0061] The embodiment of the present invention also provides a system for improving the quality of classification learning data sets, including:
[0062] Update module: used to update the dataset using a pre-designed update method;
[0063] Output module: used to output the data set in response to detecting that the proportion of clean labels in the data set does not increase;
[0064] Re-update module: used to update the data set again by using a pre-designed update method in response to detecting an increase in the proportion of clean labels in the data set.
Embodiment 3
[0066] The embodiment of the present invention also provides a device for improving the quality of the classification learning data set, including a processor and a storage medium;
[0067] The storage medium is used to store instructions;
[0068] The processor is operable in accordance with the instructions to perform the steps according to the following method:
[0069] S1. Using a pre-designed update method to update the data set;
[0070] S2. In response to detecting that the proportion of clean labels in the data set does not increase, output the data set;
[0071] S3. In response to detecting that the proportion of clean labels in the data set increases, update the data set again using a pre-designed update method;
[0072] Among them, the pre-designed update methods include:
[0073] Obtain the error transition probability matrix of the label through the network output of the anchor sample;
[0074] Obtain the error rate and weight of the label according to the err...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com