Mongolian large vocabulary continuous speech recognition method

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speech recognition and large vocabulary technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of sparse language model data, long speech recognition time, and the inability of speech recognition system to contain large-scale Mongolian words, etc. The effect of improving the amount of calculation and system performance

Active Publication Date: 2016-09-21

INNER MONGOLIA UNIVERSITY

View PDF9 Cites 33 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0009] In order to achieve the above object, the present invention provides a method for continuous speech recognition of large Mongolian vocabulary, which solves the problem that the speech recognition system in the prior art cannot contain large-scale Mongolian words, and the time of speech recognition is too long caused by the excessive amount of words , the problem of sparse language model data in speech recognition systems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0033] The principle of Mongolian segmentation recognition:

[0034] Mongolian is a typical agglutinative language, and Mongolian words are mainly formed by splicing roots and affixes, such as figure 2 shown. From the combination of roots and affixes, it can be seen that the splicing of roots and word-forming affixes or configurational suffixes has actual semantic modification, while the splicing of subsequent and ending suffixes has only grammatical meaning, and its position is always stored in the composition the end of the word. Ending suffixes do not belong to stem suffixes, which include case suffixes of static words, possessive (owner) suffixes, formal verbs (time, person) suffixes and adverb suffixes. As for the verb suffix, if the verb acts as the predicate of the main clause, it can be considered as an ending suffix, but when the verb is used as a static word (especially when it is followed by a case suffix), it can be considered as a stem suffix. In general, the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a Mongolian large vocabulary continuous speech recognition method. The method is composed of a preprocessing phase, a preparation phase, a training phase, a decoding phase and a synthesis conversion phase. In the preprocessing phase, text training data is segmented and a pronunciation dictionary is established; in the preparation phase, acoustic features are extracted from input voice signals; in the training phase, an acoustic model is trained by use of a whole-word pronunciation dictionary, and a language model is trained by use of a training text after segmentation; in the decoding phase, by use of the acoustic model, the language model and the pronunciation dictionary, the input acoustic features are recognized into text information; and in the synthesis conversion phase, case suffix errors in a decoding process are corrected by use of rules, stems are merged with case suffixes, and finally, sentences composed of Mongolian words are output. According to the invention, the problems of too long voice recognition time and language model data sparsity in a voice recognition system because the voice recognition system cannot include large-scale Mongolian words and the vocabulary is too large in the prior art are solved.

Description

technical field [0001] The invention belongs to the technical field of speech recognition and relates to a method for continuous speech recognition of Mongolian large vocabulary. Background technique [0002] Speech recognition is a key technology to realize human-machine voice communication. It involves multiple disciplines such as acoustics, linguistics, digital signal processing, and computer science. It is a cutting-edge technology in the field of information processing. The main problem to be solved is how to Received voice messages are converted to text messages. According to different task requirements, speech recognition can be divided into several types: speaker recognition, keyword detection and continuous speech recognition. At present, it has been successfully applied to various fields such as industry, home appliances, communications, automotive electronics, medical care, home services and consumer electronics, and has achieved very good results. [0003] The ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L15/02G10L15/18

CPCG10L15/02G10L15/18

Inventor飞龙高光来张红伟

OwnerINNER MONGOLIA UNIVERSITY

Mongolian large vocabulary continuous speech recognition method

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology