Voice annotation quality determining method and device, equipment and computer readable medium

A technology of voice annotation and determination method, which is applied in the computer field to achieve the effect of improving efficiency

Active Publication Date: 2019-09-20
北京晴数智慧科技有限公司
View PDF11 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Quality inspection often adopts the method of randomly extracting labeler data, which is relatively random and may miss poor-quality labeling data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice annotation quality determining method and device, equipment and computer readable medium
  • Voice annotation quality determining method and device, equipment and computer readable medium
  • Voice annotation quality determining method and device, equipment and computer readable medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] In order to make the purposes, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments It is a part of the embodiments of this application, but not all of them. Based on the embodiments in the present application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present application.

[0048] At present, voice data acquisition is often marked manually, and then the marked data is qualified and accepted through quality inspection. The quality of data marked by different annotators will be uneven, and quality inspectors need to check the quality of the data again. The more accurate the data obtained after quality inspec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a voice annotation quality determining method and device, equipment and a computer readable medium. The voice annotation quality determining method comprises the steps that a target audio file is input into a preset voice recognition model to obtain a pre-recognition text and a Bayes risk value of the pre-recognition text; annotation process information of the pre-recognition text in the annotation process by annotation personnel and history annotation information when the annotation personnel annotates a history annotation text are obtained; based on the Bayes risk value, the annotation process information and the history annotation information to determine the text reliability of an annotated text obtained by annotating the pre-recognition text by the annotation personnel; and the annotation quality of the annotation text is determined according to the text reliability. According to the voice annotation quality determining method and device, the equipment and the computer readable medium, inspectors can be assisted to pay attention to annotation texts more likely to be incorrect, and thus the efficiency of the whole voice data annotation quality testing is improved.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to a method, device, equipment and computer-readable medium for determining the quality of speech annotation. Background technique [0002] At present, with the breakthrough of artificial intelligence technology, voice, as an important part of human-computer interaction, is becoming more and more prominent. However, due to the large differences in the corresponding speech in different regions, in order to establish an effective acoustic model, it is necessary to label a large amount of speech data. [0003] At present, voice data acquisition is often marked manually, and then the marked data is qualified and accepted through quality inspection. The quality of data marked by different annotators will be uneven, and quality inspectors need to check the quality of the data again. The more accurate the data obtained after quality inspection and acceptance, the better the ef...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/01G10L15/26G10L15/06G06K9/62G06F16/35
CPCG10L15/01G10L15/26G10L15/063G06F16/35G06F18/24155
Inventor 张晴晴何淑琳刘天宇杨金富罗磊马光谦汪洋
Owner 北京晴数智慧科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products