Check patentability & draft patents in minutes with Patsnap Eureka AI!

Voice data annotation method and device and electronic equipment

A voice data and voice technology, applied in the field of voice data labeling, can solve the problems of consuming a lot of human resources and error-prone, and achieve the effect of saving manpower consumption and improving the accuracy of labeling

Pending Publication Date: 2021-05-28
北京捷通数智科技有限公司
View PDF6 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Relying entirely on manual data labeling not only consumes a lot of human resources, but is also error-prone due to manual operations.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice data annotation method and device and electronic equipment
  • Voice data annotation method and device and electronic equipment
  • Voice data annotation method and device and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0049] refer to figure 1 , shows a flow chart of steps of a voice data labeling method according to an embodiment of the present invention.

[0050] The voice data labeling method of the embodiment of the present invention may comprise the following steps:

[0051] Step 101: input the speech data and text into the pre-trained acoustic model, and obtain the posterior probability and alignment result of each frame of speech for all ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a voice data annotation method and device and electronic equipment. The method comprises the steps: inputting voice data and a text into a pre-trained acoustic model, and acquiring the posterior probability of each frame of voice for all phonemes and an alignment result; determining each first voice frame corresponding to each phoneme according to the alignment result; for each phoneme in the text, determining whether the phoneme is a doubt phoneme according to the posterior probability of the first voice frame for the phoneme; and performing first marking on each doubt phoneme in the text. According to the voice data annotation method provided by the invention, the accuracy of a manual voice data annotation result can be improved, and the manpower consumption of manual voice data annotation can be saved.

Description

technical field [0001] The invention relates to the technical field of voice recognition, in particular to a voice data labeling method and device, and electronic equipment. Background technique [0002] At present, with the breakthrough of artificial intelligence technology, voice, as an important part of human-computer interaction, is becoming more and more prominent. However, due to the large differences in the corresponding voices in different regions, it is necessary to label a large amount of voice data in order to establish an effective acoustic model. [0003] In speech data annotation, text needs to be annotated according to the sound. Generally, a large number of outsourced personnel are used for data labeling, and the marked data is checked and accepted to determine whether the correct rate of labeling meets the standard. Relying entirely on manual data labeling not only consumes a lot of human resources, but also is prone to errors due to manual operations. C...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/06G10L15/183G10L15/02
CPCG10L15/063G10L15/183G10L15/02G10L2015/025
Inventor 肖娜张欢郭佳武卫东
Owner 北京捷通数智科技有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More