A Tibetan speech corpus labeling method and system based on collaborative batch active learning

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of voice tagging and corpus tagging, applied in the field of speech recognition and corpus training, to speed up the construction and improve the quality of tagging

Active Publication Date: 2020-12-11

MINZU UNIVERSITY OF CHINA

View PDF13 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Solve the construction of sample evaluation function and the proof of its submodularity property by approaching the optimal batch sample selection method

Through the collaborative labeling method of the labeling committee, the construction of the labeling decision function, the modeling of the labeler evaluation model and the labeler-assisted learning model are solved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0036] The principles of the disclosure will now be described with reference to some example embodiments. It can be understood that these embodiments are described only for the purpose of illustrating and helping those skilled in the art to understand and implement the present disclosure, rather than suggesting any limitation to the scope of the present disclosure. The disclosure described herein may be implemented in various ways other than those described below.

[0037] As used herein, the term "comprising" and its variations may be understood as open-ended terms meaning "including but not limited to". The term "based on" may be understood as "based at least in part on". The term "one embodiment" can be read as "at least one embodiment". The term "another embodiment" may be understood as "at least one other embodiment".

[0038] In this application, the Tibetan continuous speech corpus collected, including but not limited to news broadcast corpus and spoken dialogue corp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a Tibetan speech corpus tagging method and system based on cooperative batch active learning, wherein the system includes: a sample selection module, a manual tagging module, a tagging decision module, a tagger evaluation module, and a training set generation module. The invention solves the construction of the sample evaluation function and the proof of the nature of the submodular function through the approach to the optimal batch sample selection method, and solves the construction of the labeling decision function, the construction of the labeler evaluation model and the labeler-assisted learning model through the labeling committee collaborative labeling method mold. In addition, the system of the present invention can realize functions such as optimal selection of samples, annotation evaluation of users, sharing of annotation information and Tibetan phonetic knowledge, assisted learning by annotators, and is intended to improve the tagging quality of Tibetan speech data and speed up the construction of audio corpus.

Description

technical field [0001] The invention relates to the fields of speech recognition and corpus training, in particular to a method and system for labeling Tibetan speech corpus based on cooperative batch active learning. Background technique [0002] In the field of speech recognition, traditional speech recognition algorithms (such as HMM, DBNs, ANN, and DTW, etc.) use a supervised learning method to establish a speech recognition model. In order to establish a speech recognition model with high accuracy, this learning method requires a large number of Annotating speech corpus, and annotating speech corpus is an extremely time-consuming and labor-intensive task. Usually, the time spent on tagging words as a speech recognition unit is 10 times the actual audio sentence time (for example, a one-minute speech sentence takes nearly 10 minutes to tag), and the speech tagging work using phonemes as a recognition unit It will reach 400 times the length of the speech sentence (assumi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L15/06G10L15/14G06K9/62G06N7/00

CPCG10L15/063G10L15/144G10L2015/0631G06N7/01G06F18/24G06F18/214

Inventor赵悦徐晓娜李要嫱裴欢欢

OwnerMINZU UNIVERSITY OF CHINA

A Tibetan speech corpus labeling method and system based on collaborative batch active learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology