Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice data labeling method and system, electronic equipment and storage medium

A voice data and voice technology, applied in the field of voice data processing, can solve problems such as large data volume and difficult to meet voice production needs

Active Publication Date: 2021-04-02
北京智慧星光信息技术有限公司
View PDF7 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The demand for voice annotation data has four characteristics: large data volume, high annotation quality, multiple scenarios, and multiple languages. The traditional purely manual voice data annotation method is difficult to meet the current voice production needs.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice data labeling method and system, electronic equipment and storage medium
  • Voice data labeling method and system, electronic equipment and storage medium
  • Voice data labeling method and system, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] The technical solutions of the present invention will be clearly and completely described below in conjunction with the accompanying drawings. Apparently, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0052] The method in this embodiment is mainly applied in the production of industrial voice data labeling, and the customized voice data labeling preprocessing process. In the production of industrial voice annotation data, the data annotation speed is slow, the labor cost is large, and the data quality is difficult to guarantee. The present invention adopts voice technology means, through voice time information, text content, speech rate and other information, to quickly process the voice data to be marked, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a voice data labeling method and system, electronic equipment and a storage medium, and the method comprises the steps: firstly screening original voice data, and carrying outthe reading text matching of screened voice, and obtaining proofreading voice and proofreading text; performing word segmentation on the proofreading text to obtain a word segmentation text; performing noise reduction on the proofreading voice to obtain noise reduction voice, and inputting the voice features after feature extraction into a VAD model to obtain VAD effective voice duration of the noise reduction voice; carrying out voice forced alignment on the word segmentation text by adopting an acoustic model to obtain the word-level alignment time, word-level time intervals, segmented texts, the segmented text starting time, the ending time and the text alignment time; determining a speech speed, an effective time ratio and an error word number according to the plurality of times, and performing speech quality inspection; and segmenting the original voice according to the starting time and the ending time of the segmented text, and taking the segmented text and the segmented voice as voice annotation results. The voice annotation text with qualified quality can be automatically acquired.

Description

technical field [0001] The invention relates to the field of voice data processing, in particular to a voice data labeling method, system, electronic equipment and storage medium. Background technique [0002] With the rapid development of speech technology, the demand for reliable and high-quality speech annotation data required for model training is increasing. Especially in the field of speech recognition, it is difficult to obtain a large amount of reliable annotation data in a short time to quickly build a model. The demand for voice annotation data has four characteristics: large data volume, high annotation quality, multiple scenarios, and multiple languages. The traditional purely manual voice data annotation method is difficult to meet the current voice production needs. Therefore, how to automatically obtain voice-annotated text and ensure the quality of the voice-annotated text has become an urgent problem to be solved. Contents of the invention [0003] In vie...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L25/60G10L25/87G10L21/02
CPCG10L25/60G10L25/87
Inventor 张旺
Owner 北京智慧星光信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products