Tibetan speech corpus labeling method and system based on cooperative batch active learning

A technology of voice annotation and corpus annotation, which is applied in the field of corpus training and speech recognition to improve the quality of annotation and speed up construction

Active Publication Date: 2018-03-16
MINZU UNIVERSITY OF CHINA
View PDF13 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Solve the construction of sample evaluation function and the proof of its submodularity property by approaching the optimal batch sample selection method
Through the collaborative la...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Tibetan speech corpus labeling method and system based on cooperative batch active learning
  • Tibetan speech corpus labeling method and system based on cooperative batch active learning
  • Tibetan speech corpus labeling method and system based on cooperative batch active learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The principles of the present disclosure will now be described with reference to some example embodiments. It can be understood that these embodiments are only described for the purpose of illustration and to help those skilled in the art understand and implement the present disclosure, and do not suggest any limitation on the scope of the present disclosure. The content of the present disclosure described here can be implemented in various ways other than those described below.

[0037] As described herein, the term "including" and its various variants can be understood as open-ended terms, which means "including but not limited to." The term "based on" can be understood as "based at least in part on." The term "one embodiment" may be understood as "at least one embodiment." The term "another embodiment" may be understood as "at least one other embodiment."

[0038] In this application, the collected Tibetan continuous speech corpus, including but not limited to news bro...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Tibetan speech corpus labeling method and a Tibetan speech corpus labeling system based on cooperative batch active learning. The system comprises a sample selection module,a manual labeling module, a labeling decision-making module, a labeling person evaluation module and a training set generation module. According to the method and the system, the construction of a sample evaluation function and the proving of the submodular function property of the sample evaluation function are solved through the adjacent optimal batch sample selection method, and the construction of a labeling decision function and the modeling of a labeling person evaluation model and a labeling person auxiliary learning model are solved through the labeling committee collaborative labelingmethod. In addition, the system disclosed by the invention can realize the functions inducing the optimal selection of a sample, the labeling and evaluation of a user, the sharing of labeling information and Tibetan speech knowledge, auxiliary learning of the labeling person and the like, so that the labeling quality of the Tibetan speech data is improved, and the construction of the speech corpus is accelerated.

Description

Technical field [0001] The invention relates to the field of speech recognition and corpus training, and in particular to a method and system for labeling Tibetan speech corpus based on collaborative batch active learning. Background technique [0002] In the field of speech recognition, traditional speech recognition algorithms (such as HMM, DBNs, ANN, DTW, etc.) use supervised learning methods to establish speech recognition models. In order to establish high-accuracy speech recognition models, this learning method requires a large number of bands. Annotating speech corpus, and annotating speech corpus is an extremely time-consuming and laborious task. Usually the time spent on labeling with words as the speech recognition unit is 10 times that of the actual audio sentence (for example, the labeling time for a one-minute speech sentence is close to 10 minutes), and the speech labeling with phoneme as the recognition unit It will reach 400 times the length of the speech sentenc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/06G10L15/14G06K9/62G06N7/00
CPCG10L15/063G10L15/144G10L2015/0631G06N7/01G06F18/24G06F18/214
Inventor 赵悦徐晓娜李要嫱裴欢欢
Owner MINZU UNIVERSITY OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products