Unlock instant, AI-driven research and patent intelligence for your innovation.

Method for constructing acoustic model, speech recognition system and speech recognition method

An acoustic model and phoneme technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as gradient disappearance, increase model depth, and improve recognition effect, so as to reduce costs and ensure performance

Active Publication Date: 2022-04-15
AISPEECH CO LTD
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in practice, it is found that this method of only adding jump links between single layers will still cause the problem of gradient disappearance when training a deep DFSMN with a very large number of layers, resulting in the model not being able to be trained robustly, resulting in Increasing the depth of the model cannot improve the recognition effect, and may even lead to a decrease in the recognition accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for constructing acoustic model, speech recognition system and speech recognition method
  • Method for constructing acoustic model, speech recognition system and speech recognition method
  • Method for constructing acoustic model, speech recognition system and speech recognition method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0062] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0063] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other.

[0064] The invention may be described in the general context of computer-executable instructions, such as program modules, being executed by a computer. Generally, progr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for constructing an acoustic model, comprising: pre-training a gated residual DFSMN neural network module; sequentially connecting a plurality of said gated residual DFSMN neural network modules; The control residual DFSMN neural network module is configured with an input layer and an output layer to form the acoustic module. In the embodiment of the present invention, the gated residual DFSMN neural network module is first pre-trained, and then a plurality of sequential connections are formed to form a deep gated residual DFSMN network as an acoustic model. Since the pre-trained gated residual DFSMN neural network module itself has multiple layers of DFSMN, the obtained acoustic model with multiple gated residual DFSMN neural network modules can contain hundreds of DFSMN layers. In actual training, only a relatively small gated residual DFSMN neural network module needs to be trained, and finally a deep acoustic model is obtained by stacking. Therefore, it not only reduces the cost of training and learning, but also ensures the performance of the final acoustic model.

Description

technical field [0001] The invention relates to the technical field of speech recognition, in particular to a method for constructing an acoustic model, a speech recognition system and a speech recognition method. Background technique [0002] Speech recognition, in layman's terms, is to convert a piece of speech signal into corresponding text information. Specifically, speech recognition is to sample from a continuous sound wave, quantize each sample value; then divide the quantized sampled audio into frames, and extract a feature vector describing the content of the spectrum for each frame; finally, according to the speech signal The features recognize the words represented by the speech. [0003] The entire process of speech recognition mainly includes feature extraction and decoding (acoustic model, dictionary, language model) parts. [0004] Feature extraction: extract the time-varying speech feature sequence from the speech waveform (that is, convert the sound signal...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/06G10L15/16G10L15/02
Inventor 薛少飞
Owner AISPEECH CO LTD