Voice marking quality determination method and device, equipment and computer-readable medium

A technology of voice annotation and determination method, applied in the computer field to achieve the effect of improving efficiency

Inactive Publication Date: 2019-07-30
北京晴数智慧科技有限公司
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Quality inspection often adopts the method of randomly extracting labeler data, which is relatively random and may miss poor-quality labeling data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice marking quality determination method and device, equipment and computer-readable medium
  • Voice marking quality determination method and device, equipment and computer-readable medium
  • Voice marking quality determination method and device, equipment and computer-readable medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] In order to make the purposes, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments It is a part of the embodiments of this application, but not all of them. Based on the embodiments in the present application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present application.

[0048] At present, voice data acquisition is often marked manually, and then the marked data is qualified and accepted through quality inspection. The quality of data marked by different annotators will be uneven, and quality inspectors need to check the quality of the data again. The more accurate the data obtained after quality inspec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a voice marking quality determination method and device, equipment and a computer-readable medium. The method comprises the steps of inputting a target audio file into a preset voice recognition model to obtain pre-recognized texts and a Bayesian risk value of the pre-recognized texts; acquiring marking process information of a marking worker in a marking process of the pre-recognized texts and historical marking information of the marking worker during marking of historical marked texts; on the basis of the Bayesian risk value, the marking process information and thehistorical marking information, determining the text reliability of marked texts obtained after the marking worker marks the pre-recognized texts; determining the marking quality of the marked texts according to the text credibility. According to the voice marking quality determination method and device, the equipment and the computer-readable medium, inspectors can be helped to pay attention to the marked texts which are more likely to be wrong, and therefore the efficiency of the whole quality detection of voice data marking is improved.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to a method, device, equipment and computer-readable medium for determining the quality of speech annotation. Background technique [0002] At present, with the breakthrough of artificial intelligence technology, voice, as an important part of human-computer interaction, is becoming more and more prominent. However, due to the large differences in the corresponding speech in different regions, in order to establish an effective acoustic model, it is necessary to label a large amount of speech data. [0003] At present, voice data acquisition is often marked manually, and then the marked data is qualified and accepted through quality inspection. The quality of data marked by different annotators will be uneven, and quality inspectors need to check the quality of the data again. The more accurate the data obtained after quality inspection and acceptance, the better the ef...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/01G10L15/26G10L15/06G06K9/62G06F16/35
CPCG10L15/01G10L15/26G10L15/063G06F16/35G06F18/24155
Inventor 张晴晴何淑琳刘天宇杨金富罗磊马光谦汪洋
Owner 北京晴数智慧科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products