Mongolian large vocabulary continuous speech recognition method

A speech recognition and large vocabulary technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of sparse language model data, long speech recognition time, and the inability of speech recognition system to contain large-scale Mongolian words, etc. The effect of improving the amount of calculation and system performance

Active Publication Date: 2016-09-21
INNER MONGOLIA UNIVERSITY
View PDF9 Cites 33 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] In order to achieve the above object, the present invention provides a method for continuous speech recognition of large Mongolian vocabulary, which solves the problem that the speech recognition system in the prior art cannot contain large-scale Mongolian words, and the time of speech recognition is too long caused by the excessive amount of words , the problem of sparse language model data in speech recognition systems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Mongolian large vocabulary continuous speech recognition method
  • Mongolian large vocabulary continuous speech recognition method
  • Mongolian large vocabulary continuous speech recognition method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The principle of Mongolian segmentation recognition:

[0034] Mongolian is a typical agglutinative language, and Mongolian words are mainly formed by splicing roots and affixes, such as figure 2 shown. From the combination of roots and affixes, it can be seen that the splicing of roots and word-forming affixes or configurational suffixes has actual semantic modification, while the splicing of subsequent and ending suffixes has only grammatical meaning, and its position is always stored in the composition the end of the word. Ending suffixes do not belong to stem suffixes, which include case suffixes of static words, possessive (owner) suffixes, formal verbs (time, person) suffixes and adverb suffixes. As for the verb suffix, if the verb acts as the predicate of the main clause, it can be considered as an ending suffix, but when the verb is used as a static word (especially when it is followed by a case suffix), it can be considered as a stem suffix. In general, the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Mongolian large vocabulary continuous speech recognition method. The method is composed of a preprocessing phase, a preparation phase, a training phase, a decoding phase and a synthesis conversion phase. In the preprocessing phase, text training data is segmented and a pronunciation dictionary is established; in the preparation phase, acoustic features are extracted from input voice signals; in the training phase, an acoustic model is trained by use of a whole-word pronunciation dictionary, and a language model is trained by use of a training text after segmentation; in the decoding phase, by use of the acoustic model, the language model and the pronunciation dictionary, the input acoustic features are recognized into text information; and in the synthesis conversion phase, case suffix errors in a decoding process are corrected by use of rules, stems are merged with case suffixes, and finally, sentences composed of Mongolian words are output. According to the invention, the problems of too long voice recognition time and language model data sparsity in a voice recognition system because the voice recognition system cannot include large-scale Mongolian words and the vocabulary is too large in the prior art are solved.

Description

technical field [0001] The invention belongs to the technical field of speech recognition and relates to a method for continuous speech recognition of Mongolian large vocabulary. Background technique [0002] Speech recognition is a key technology to realize human-machine voice communication. It involves multiple disciplines such as acoustics, linguistics, digital signal processing, and computer science. It is a cutting-edge technology in the field of information processing. The main problem to be solved is how to Received voice messages are converted to text messages. According to different task requirements, speech recognition can be divided into several types: speaker recognition, keyword detection and continuous speech recognition. At present, it has been successfully applied to various fields such as industry, home appliances, communications, automotive electronics, medical care, home services and consumer electronics, and has achieved very good results. [0003] The ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/02G10L15/18
CPCG10L15/02G10L15/18
Inventor 飞龙高光来张红伟
Owner INNER MONGOLIA UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products