Acoustic modeling method based on gated recurrent unit

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of recurrent units and modeling methods, applied in neural learning methods, biological neural network models, speech analysis, etc., can solve problems such as limiting speech recognition system performance, insufficient noise data robustness, long training time, etc., to improve recognition Performance, improved convergence speed, and improved robustness

Inactive Publication Date: 2020-11-13

成都启英泰伦科技有限公司

View PDF3 Cites 3 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] However, in practical applications, this type of method is still far from the requirements of large-scale commercialization. The reason is that GRU still has problems such as too many model parameters, too long training time, and not robust enough to noisy data. Greatly limit the performance of speech recognition systems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0023] Specific embodiments of the present invention will be further described in detail below.

[0024] The acoustic modeling method based on the gated recurrent unit of the present invention can be used in the scene of continuous speech recognition, and can also be used in modeling under other situations related to speech recognition, specifically as figure 1 shown.

[0025] Step 1. Extract the corresponding acoustic features from the original audio data , the subscript t=1, 2, ..., T, where T is the frame number of the speech signal.

[0026] Step 2. Use layer normalization to improve the gated recurrent unit, use the improved gated recurrent unit function to calculate the forward output of the neural network, and normalize it to obtain the output probability of each neuron;

[0027] Regularization can use the softmax function;

[0028] The specific way of normalization is:

[0029] (1.0)

[0030] in, yes Corresponding No. elements, The network output label...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an acoustic modeling method based on a gated recurrent unit. The acoustic modeling method comprises the following steps: step 1, extracting corresponding acoustic features fromoriginal audio data; step 2, improving the gated recurrent unit by utilizing layer normalization, and calculating forward output of a neural network by utilizing the improved gated recurrent unit; step 3, training a model according to the state vector, calculated in the step 2, of the current moment; and step 4, decoding the trained model to find an output sequence with the maximum probability. According to the invention, the layer normalization technology is applied to the gated recurrent neural unit, so that the activation value of neurons can be normalized, the network convergence speed isimproved, and the network training time is reduced; an activation function in a traditional gated recurrent unit is replaced with an ELU activation function, so that the data robustness is improved;meanwhile, by optimizing a computational formula of the gate structure, model parameters of a traditional gated recurrent unit are reduced, and the recognition performance of the model can be improved.

Description

technical field [0001] The invention belongs to the technical field of speech recognition and relates to an acoustic modeling method, in particular to an acoustic modeling method based on a gated recurrent unit. Background technique [0002] In recent years, with the continuous development of artificial intelligence and computer technology, deep learning technology has been widely used in image, voice and other fields. As one of the most natural interaction interfaces between machines and humans, speech has become a research hotspot in academia and industry. [0003] The acoustic model is one of the core modules of the speech recognition system, and its performance directly affects the entire speech system. Before 2009, the basic structure of the acoustic model was a Gaussian Mixture Model-Hidden Markov Model (GMM-HMM). However, with the successful application of neural networks in the field of speech recognition, the traditional GMM- HMM is gradually replaced by DNN-HMM (...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L15/06G10L15/14G10L15/16G06N3/08

CPCG06N3/084G10L15/063G10L15/144G10L15/148G10L15/16G10L2015/0631

Inventor温登峰何云鹏许兵

Owner成都启英泰伦科技有限公司

Acoustic modeling method based on gated recurrent unit

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology