Construction Method of Acoustic Model of Deep Long Short-Term Memory Recurrent Neural Network Based on the Principle of Selective Attention

A recurrent neural network, long-term and short-term memory technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as inability to meet practical performance

Active Publication Date: 2018-01-12
TSINGHUA UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, most recognition systems are still very sensitive to changes in the acoustic environment, especially under the interference of cross-talk noise (two or more people talking at the same time) and cannot meet the requirements of practical performance.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Construction Method of Acoustic Model of Deep Long Short-Term Memory Recurrent Neural Network Based on the Principle of Selective Attention
  • Construction Method of Acoustic Model of Deep Long Short-Term Memory Recurrent Neural Network Based on the Principle of Selective Attention

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] The implementation of the present invention will be described in detail below in conjunction with the drawings and examples.

[0021] The invention realizes continuous speech recognition by utilizing a deep long-short-term memory cycle neural network acoustic model based on the principle of selective attention. However, the models and methods provided by the present invention are not limited to continuous speech recognition, and can also be any methods and devices related to speech recognition.

[0022] The present invention mainly comprises the steps:

[0023] The first step is to construct a deep long short-term memory recurrent neural network based on the principle of selective attention

[0024] Such as figure 1 As shown, input 101 and input 102 are voice signal input x at time t and t-1 time t and x t-1 (t∈[1, T], T is the total time length of the voice signal); the long-short-term memory recurrent neural network at t moment is composed of attention gate 103, i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A construction method for a deep long short-term memory recurrent neural network acoustic model based on a selective attention principle. Change of an instant function of neurons of an auditory cortex is represented by adding an attention gate (103) unit in the deep long short-term memory recurrent neural network acoustic model, and the attention gate (103) unit is different from other gate units in that: the other gate units correspond to a time sequence on a one-to-one basis, but the attention gate (103) unit shows a short-term plasticity effect, thereby having intervals on the time sequence; extraction of robust features about Cross-talk noise and construction of a robust acoustic model are realized via the recurrent neural network acoustic model obtained by training a large amount of voice data containing the Cross-talk noise, and the purpose of increasing the robustness about the acoustic model can be achieved by restraining the influence of a non-target stream against the extraction of the features; the method can be extensively applied to the field of a plurality of machine learning related to speaker recognition and keyword recognition in voice recognition, human-machine interaction and the like.

Description

technical field [0001] The invention belongs to the field of audio technology, in particular to a method for constructing an acoustic model of a deep long-term short-term memory cyclic neural network based on the principle of selective attention. Background technique [0002] With the rapid development of information technology, speech recognition technology has the conditions for large-scale commercialization. At present, speech recognition mainly uses continuous speech recognition technology based on statistical models, and its main goal is to find the word sequence with the highest probability represented by a given speech sequence. The task of the continuous speech recognition system based on the statistical model is to find the most probable word sequence represented by the given speech sequence, usually including the construction of acoustic models and language models and their corresponding search and decoding methods. With the rapid development of acoustic models an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/02G10L15/06G10L15/16
CPCG10L15/02G10L15/06
Inventor 杨毅孙甲松
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products