Supercharge Your Innovation With Domain-Expert AI Agents!

Sample processing method and device, equipment and storage medium

A processing method and sample technology, applied in the field of data processing, can solve the problems of unbalanced samples and low model accuracy, and achieve the effect of reducing the number of samples and improving the accuracy

Pending Publication Date: 2022-04-12
AGRICULTURAL BANK OF CHINA
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] For the field of machine learning, if there is a problem of unbalanced samples in the sample data set, when the sample data set is used for model training, the output results of the trained model will be biased towards the larger category samples, resulting in the model is less accurate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sample processing method and device, equipment and storage medium
  • Sample processing method and device, equipment and storage medium
  • Sample processing method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0049] The terms used in the embodiments of the present application are only for the purpose of describing specific embodiments, and are not intended to limit the present invention. The singular forms "a" and "the" used in the embodiments of the present application are also intended to include plural forms, unless the context clearly indicates oth...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a sample processing method and device, equipment and a storage medium, and aims to further cluster samples (namely, first samples in the application) with a large proportion in a sample data set with a sample imbalance problem. According to the method, category samples with large proportions are further divided into a plurality of clusters composed of similar sample data, so that the sample number of the category samples with large proportions can be greatly reduced to ensure that the number proportion of different categories of samples in the sample data set is a small value, model training is carried out through the processed sample data set, and the model training efficiency is improved. And the accuracy of the model can be improved.

Description

technical field [0001] The present application relates to the technical field of data processing, and in particular to a sample processing method, device, equipment and storage medium. Background technique [0002] Sample imbalance refers to the situation where the distribution of sample data of different categories in a given sample data set is unbalanced. Among them, the first category sample with a large proportion of data and the second category sample with a small proportion reach a large Proportion. [0003] For the field of machine learning, if there is a problem of unbalanced samples in the sample data set, when the sample data set is used for model training, the output results of the trained model will be biased towards the larger category samples, resulting in the model The accuracy is lower. [0004] The above information disclosed in this Background section is only for enhancement of understanding of the background of the application and therefore it may contai...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
Inventor 吴振阳孙岚子任哲丰
Owner AGRICULTURAL BANK OF CHINA
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More