Supercharge Your Innovation With Domain-Expert AI Agents!

Data set quality evaluation method and device, electronic equipment and storage medium

A technology of quality assessment and data collection, applied in the field of data processing, can solve the problems of lack of accuracy and completeness evaluation of manual review, and achieve the effect of solving lack of accuracy and improving accuracy and completeness

Pending Publication Date: 2022-05-06
CHINA TELECOM CORP LTD
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The disclosure provides a data set quality assessment method and device, electronic equipment, and storage media, which overcome the problem of lack of accuracy and completeness assessment of data set quality due to manual review at least to a certain extent in related technologies

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data set quality evaluation method and device, electronic equipment and storage medium
  • Data set quality evaluation method and device, electronic equipment and storage medium
  • Data set quality evaluation method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] Example embodiments will now be described more fully with reference to the accompanying drawings. Example embodiments may, however, be embodied in many forms and should not be construed as limited to the examples set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete and will fully convey the concept of example embodiments to those skilled in the art. The described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments.

[0035] Furthermore, the drawings are merely schematic illustrations of the present disclosure and are not necessarily drawn to scale. The same reference numerals in the drawings denote the same or similar parts, and thus repeated descriptions thereof will be omitted. Some of the block diagrams shown in the drawings are functional entities and do not necessarily correspond to physically or logically separate entities. These functional entities ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a data set quality evaluation method and device, electronic equipment and a storage medium, and relates to the technical field of data processing. The method comprises the following steps: inputting a to-be-evaluated data set into a pre-trained baseline model, and calculating the model accuracy of the baseline model on the to-be-evaluated data set; judging whether the model accuracy is greater than a preset threshold; if yes, the to-be-evaluated data set is classified according to the output result of the baseline model, the quality evaluation result of the to-be-evaluated data set is determined according to the classification result, and the preset quality evaluation indexes comprise one or more indexes for performing quality evaluation on the to-be-evaluated data set; and if not, determining a quality assessment result of the to-be-assessed data set according to the model accuracy and the generalization ability parameter of the baseline model. According to the method, the accuracy of the model is judged and calculated differently, so that the accuracy and completeness of data set quality evaluation are improved.

Description

technical field [0001] The present disclosure relates to the technical field of data processing, and in particular to a data set quality assessment method and device, electronic equipment, and a storage medium. Background technique [0002] As one of the key technologies of artificial intelligence, deep learning has three core elements: big data, deep learning algorithm design and high-performance computing platform. Among them, big data is the foundation of the current development of artificial intelligence. In the field of supervised learning of classification problems, big data is embodied as a training data set with classification labels. The quality of the training dataset directly affects the performance of the predictive model. [0003] Data set quality assessment needs to consider factors such as completeness, accuracy, and balance. It should also consider the needs of the data set to meet the application scenario, that is, the completeness of the description of the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62G06N3/08
CPCG06N3/08G06F18/217G06F18/214G06F18/24
Inventor 汪少敏
Owner CHINA TELECOM CORP LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More