Supercharge Your Innovation With Domain-Expert AI Agents!

Text data detection method and device, electronic equipment and storage medium

A text data and detection method technology, applied in the computer field, can solve the problem of high cost of text data, achieve the effect of low cost and reduce labor cost

Active Publication Date: 2021-09-17
PING AN TECH (SHENZHEN) CO LTD
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In order to solve the problem in the prior art that the cost of incorrectly marked text data due to human inspection is too high, the present application provides a text data detection method, device, electronic equipment and storage medium

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text data detection method and device, electronic equipment and storage medium
  • Text data detection method and device, electronic equipment and storage medium
  • Text data detection method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] Example embodiments will now be described more fully with reference to the accompanying drawings. Example embodiments may, however, be embodied in many forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of example embodiments to those skilled in the art. The same reference numerals denote the same or similar parts in the drawings, and thus their repeated descriptions will be omitted.

[0035]Furthermore, the described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. In the following description, numerous specific details are provided in order to give a thorough understanding of embodiments of the present disclosure. However, those skilled in the art will appreciate that the technical solutions of the present disclosure may be practiced without one or mor...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of computers, in particular to a text data detection method and device, electronic equipment and a storage medium. The method comprises: obtaining a sample data set; inputting the text data sample into a pre-trained text classification model to obtain classification probabilities of a plurality of preset text categories, and determining a first probability statistical value of the text data sample according to the classification probabilities; selecting a part of text data samples from the sample data set, and uniformly replacing sample tags of the part of text data samples with alternative sample tags; inputting the text data samples with the alternative sample labels into a text classification model to obtain classification probabilities of a plurality of text categories, and determining second probability statistical values of the text data samples with the alternative sample labels according to the classification probabilities; and determining whether the sample label of the text data sample is correctly labeled or not according to the numerical relationship between the first probability statistical value and the second probability statistical value. By adopting the method provided by the invention, the accuracy of the sample labels of the text data samples can be improved.

Description

technical field [0001] The present application relates to the field of computer technology, and in particular to a text data detection method, device, electronic equipment and storage medium. Background technique [0002] Text classification is a very important module in text processing and is widely used. The quality of labeled data in text classification is very important, and it is related to the actual effect of the classification model. In actual training, a large amount of labeled data is required. If incorrectly labeled data is used to participate in classification model training, the classification accuracy rate will decrease, and classification errors will increase, which will affect the overall classification performance. [0003] General data labeling is done by man-made design rules, and then by human-operated computers. But sometimes the data set is very large, and the data formulated for some special situations or scenarios is special or important, and needs ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35G06K9/62
CPCG06F16/353G06F18/214
Inventor 司世景王健宗
Owner PING AN TECH (SHENZHEN) CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More