Supercharge Your Innovation With Domain-Expert AI Agents!

Data processing method, device, equipment and medium

A data processing and data technology, applied in the field of artificial intelligence, can solve problems such as the inability to label data quality inspections, achieve the effect of convenient traceability and positioning, and reduce workload

Pending Publication Date: 2021-04-16
BEIJING ORION STAR TECH CO LTD
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The embodiment of the present invention provides a data processing method, device, equipment, and medium to solve the problem that the quality of marked data cannot be checked by electronic equipment

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data processing method, device, equipment and medium
  • Data processing method, device, equipment and medium
  • Data processing method, device, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0027] Example 1: figure 1 A schematic diagram of a data processing process provided by an embodiment of the present invention, the process includes:

[0028] S101: Obtain any labeled data to be quality-checked and audio data corresponding to the labeled data, where the labeled data includes text data corresponding to the audio data and its first text feature.

[0029] The data processing method provided by the embodiment of the present invention is applied to an electronic device, and the electronic device may be a smart device such as a robot or a server.

[0030] In the embodiment of the present invention, the annotation data corresponding to the audio data includes the text data corresponding to the audio data and the text features of the text data (referred to as the first text feature for convenience of description), and the first text feature includes the text data corresponding to The sequence of consonants and finals will contain the consonants and finals correspondi...

Embodiment 2

[0039] Embodiment 2: In order to improve the accuracy and efficiency of quality inspection of the labeled data to be inspected, on the basis of the above-mentioned embodiments, in the embodiment of the present invention, it is judged whether the labeled data is correct according to the quality inspection data corresponding to the labeled data ,include:

[0040] If it is determined that the quality inspection data corresponding to the labeled data meets the pre-configured quality inspection requirements, it is determined that the labeled data is correct; or

[0041] If it is determined that the quality inspection data corresponding to the labeled data does not meet the pre-configured quality inspection requirements, it is determined that the labeled data is labeled incorrectly.

[0042] Generally, each character contained in the labeled data that is labeled correctly corresponds to at least one audio frame in the audio data corresponding to the labeled data, and the last charac...

Embodiment 3

[0086] Embodiment 3: In order to facilitate the staff to modify the incorrectly marked data, on the basis of the above-mentioned embodiments, in the embodiment of the present invention, after determining that the quality inspection data corresponding to the marked data does not meet the pre-configured quality inspection requirements , the method also includes: outputting a prompt message indicating that the labeled data is wrong.

[0087] In the actual application scenario, when it is determined that the quality inspection data corresponding to a certain labeled data does not meet the pre-configured quality inspection requirements, it is determined that the labeled data is wrongly labeled, and the labeled data needs to be processed according to the audio data corresponding to the labeled data. Modify to make the labeling data correct, and after the modified correct labeling data is re-inputted into the speech synthesis model, the obtained quality inspection data corresponding t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a data processing method and device, equipment and a medium, and aims to solve the problem that the quality of labelled data cannot be subjected to quality inspection through electronic equipment. In the process of performing quality inspection on labelled data, quality inspection data corresponding to the labelled data can be obtained through a voice synthesis model and on the basis of the labelled data to be subjected to quality inspection and audio data corresponding to the labelled data; since the quality inspection data represents a corresponding relationship between each character in the labelled data and each audio frame in the audio data corresponding to the labelled data, whether the labelled data is correct or not can be determined according to the quality inspection data corresponding to the labelled data, so that the labelled data to be subjected to quality inspection does not need to be subjected to quality inspection manually, the workload of quality inspection personnel is reduced, the influence of the working ability of the quality inspection personnel on the quality inspection efficiency and accuracy is reduced, and the labelled data with label errors can be conveniently traced and positioned.

Description

technical field [0001] The present invention relates to the technical field of artificial intelligence, in particular to a data processing method, device, equipment and medium. Background technique [0002] In the prior art, text information is generally converted into speech information based on a speech synthesis model. In order to obtain a speech synthesis model, a large number of speech samples and corresponding label data for each speech sample are generally required to train the original speech synthesis model. Under the same model structure, a high-precision speech synthesis model can be trained based on a large number of high-quality training speech samples and the labeled data corresponding to each training speech sample. In the process of testing the trained speech synthesis model, test the number of speech samples, the balance between the sample training set and the sample test set, and the quality of the labeled data corresponding to the test speech samples, and...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/04G10L13/08
CPCY02P90/30
Inventor 李旭刘欢
Owner BEIJING ORION STAR TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More