Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Tibetan speech corpus labeling method and system based on collaborative batch active learning

A technology of voice tagging and corpus tagging, applied in the field of speech recognition and corpus training, to speed up the construction and improve the quality of tagging

Active Publication Date: 2020-12-11
MINZU UNIVERSITY OF CHINA
View PDF13 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Solve the construction of sample evaluation function and the proof of its submodularity property by approaching the optimal batch sample selection method
Through the collaborative labeling method of the labeling committee, the construction of the labeling decision function, the modeling of the labeler evaluation model and the labeler-assisted learning model are solved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Tibetan speech corpus labeling method and system based on collaborative batch active learning
  • A Tibetan speech corpus labeling method and system based on collaborative batch active learning
  • A Tibetan speech corpus labeling method and system based on collaborative batch active learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The principles of the disclosure will now be described with reference to some example embodiments. It can be understood that these embodiments are described only for the purpose of illustrating and helping those skilled in the art to understand and implement the present disclosure, rather than suggesting any limitation to the scope of the present disclosure. The disclosure described herein may be implemented in various ways other than those described below.

[0037] As used herein, the term "comprising" and its variations may be understood as open-ended terms meaning "including but not limited to". The term "based on" may be understood as "based at least in part on". The term "one embodiment" can be read as "at least one embodiment". The term "another embodiment" may be understood as "at least one other embodiment".

[0038] In this application, the Tibetan continuous speech corpus collected, including but not limited to news broadcast corpus and spoken dialogue corp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a Tibetan speech corpus tagging method and system based on cooperative batch active learning, wherein the system includes: a sample selection module, a manual tagging module, a tagging decision module, a tagger evaluation module, and a training set generation module. The invention solves the construction of the sample evaluation function and the proof of the nature of the submodular function through the approach to the optimal batch sample selection method, and solves the construction of the labeling decision function, the construction of the labeler evaluation model and the labeler-assisted learning model through the labeling committee collaborative labeling method mold. In addition, the system of the present invention can realize functions such as optimal selection of samples, annotation evaluation of users, sharing of annotation information and Tibetan phonetic knowledge, assisted learning by annotators, and is intended to improve the tagging quality of Tibetan speech data and speed up the construction of audio corpus.

Description

technical field [0001] The invention relates to the fields of speech recognition and corpus training, in particular to a method and system for labeling Tibetan speech corpus based on cooperative batch active learning. Background technique [0002] In the field of speech recognition, traditional speech recognition algorithms (such as HMM, DBNs, ANN, and DTW, etc.) use a supervised learning method to establish a speech recognition model. In order to establish a speech recognition model with high accuracy, this learning method requires a large number of Annotating speech corpus, and annotating speech corpus is an extremely time-consuming and labor-intensive task. Usually, the time spent on tagging words as a speech recognition unit is 10 times the actual audio sentence time (for example, a one-minute speech sentence takes nearly 10 minutes to tag), and the speech tagging work using phonemes as a recognition unit It will reach 400 times the length of the speech sentence (assumi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/06G10L15/14G06K9/62G06N7/00
CPCG10L15/063G10L15/144G10L2015/0631G06N7/01G06F18/24G06F18/214
Inventor 赵悦徐晓娜李要嫱裴欢欢
Owner MINZU UNIVERSITY OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products