Unlock instant, AI-driven research and patent intelligence for your innovation.

Voice data screening method, device, electronic equipment and storage medium

A voice data and screening method technology, applied in the field of information processing, can solve the problems of rising costs and high costs, and achieve the effect of improving efficiency and avoiding excessive labor costs

Active Publication Date: 2022-05-13
AISPEECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, in the above method, manual labeling of the full amount of data will lead to an increase in cost, because the voice data collected in the real environment has a high proportion of invalid data, and in the later processing of screening invalid data , and mainly rely on the results of manual labeling, therefore, the above scheme has the problem of high cost

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice data screening method, device, electronic equipment and storage medium
  • Voice data screening method, device, electronic equipment and storage medium
  • Voice data screening method, device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] In order to make the purpose, features, and advantages of the application more obvious and understandable, the technical solutions in the embodiments of the application will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the application. Obviously, the described The embodiments are only some of the embodiments of the present application, but not all of them. Based on the embodiments in this application, all other embodiments obtained by those skilled in the art without making creative efforts belong to the scope of protection of this application.

[0026] The embodiment of the present invention provides a screening method for voice data, such as figure 1 shown, including:

[0027] S11: Obtain N voice data; wherein, N is an integer greater than or equal to 2;

[0028] S12: Using a plurality of speech recognition engines to recognize each speech data in the N speech data, and obtain a plurality of recognitio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application discloses a voice data screening method, device, electronic equipment, and storage medium, wherein the method includes: acquiring N voice data; wherein, N is an integer greater than or equal to 2; using multiple voice recognition engines to Recognize each voice data in the N voice data, and obtain multiple recognition results corresponding to each voice data; wherein, among the multiple voice recognition engines, the acoustic models contained in different voice recognition engines are different , and / or different speech recognition engines contain different language models; perform speech data screening based on multiple recognition results corresponding to each speech data, and obtain the first type of invalid data in the N speech data; The remaining voice data obtained after deleting the invalid data of the first type among the N pieces of voice data is used as valid data, and the target model is modeled based on the valid data.

Description

technical field [0001] The present application relates to the field of information processing, in particular to a screening method, device, electronic equipment and storage medium for voice data. Background technique [0002] In related technologies, for the processing of speech data, the full amount of speech data is generally manually labeled first, and then the invalid data is screened out through manual review or machine learning after the labeling is completed. Specifically, it may include comparing the original manual annotation with the manual annotation generated during the review process or the automatic annotation generated based on the machine learning method, and filtering out invalid data through the degree of difference in the annotation. [0003] However, in the above method, manual labeling of the full amount of data will lead to an increase in cost, because the voice data collected in the real environment has a high proportion of invalid data, and in the lat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/06G10L15/10G10L15/18G10L15/32
CPCG10L15/063G10L15/10G10L15/18G10L15/32G10L2015/0631
Inventor 薛峰
Owner AISPEECH CO LTD