Voice annotation quality determination method, device, equipment and computer readable medium

A technology of voice annotation and determination method, which is applied in the computer field to achieve the effect of improving efficiency

Active Publication Date: 2021-12-17
BEIJING AISHU WISDOM TECH CO LTD
View PDF11 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Quality inspection often adopts the method of randomly extracting labeler data, which is relatively random and may miss poor-quality labeling data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice annotation quality determination method, device, equipment and computer readable medium
  • Voice annotation quality determination method, device, equipment and computer readable medium
  • Voice annotation quality determination method, device, equipment and computer readable medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] In order to make the purposes, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments It is a part of the embodiments of this application, but not all of them. Based on the embodiments in the present application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present application.

[0047] At present, voice data acquisition is often marked manually, and then the marked data is qualified and accepted through quality inspection. The quality of data marked by different annotators will be uneven, and quality inspectors need to check the quality of the data again. The more accurate the data obtained after quality inspec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present application relates to a method, device, equipment and computer-readable medium for determining the quality of speech annotation. The method includes inputting the target audio file into a preset speech recognition model to obtain the pre-recognized text and the Bayesian risk value of the pre-recognized text; obtaining the tagging process of the tagger during the tagging process of the pre-recognized text Information and the historical annotation information of the annotator when annotating the historical annotation text; based on the Bayesian risk value, the annotation process information and the historical annotation information, it is determined that the annotator marks the pre-identified text. The text credibility of the labeled text; determine the labeling quality of the labeled text according to the text credibility. The application can assist the acceptance checker to pay attention to the marked text that is more likely to make mistakes, thereby improving the efficiency of the quality inspection of the entire voice data mark.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to a method, device, equipment and computer-readable medium for determining the quality of speech annotation. Background technique [0002] At present, with the breakthrough of artificial intelligence technology, voice, as an important part of human-computer interaction, is becoming more and more prominent. However, due to the large differences in the corresponding speech in different regions, in order to establish an effective acoustic model, it is necessary to label a large amount of speech data. [0003] At present, voice data acquisition is often marked manually, and then the marked data is qualified and accepted through quality inspection. The quality of data marked by different annotators will be uneven, and quality inspectors need to check the quality of the data again. The more accurate the data obtained after quality inspection and acceptance, the better the ef...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/01G10L15/26G10L15/06G06K9/62G06F16/35
CPCG10L15/01G10L15/26G10L15/063G06F16/35G06F18/24155
Inventor 张晴晴何淑琳刘天宇杨金富罗磊马光谦汪洋
Owner BEIJING AISHU WISDOM TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products