Method and system for enhancing features by aid of bidirectional long-term and short-term memory recurrent neural networks

A technology of recursive neural network and long-short-term memory, which is applied in speech analysis, speech recognition, instruments, etc., can solve the problems of feature transformation and can not reflect the dynamic characteristics of the system well, and achieve the effect of improving performance and enhancing effect

Inactive Publication Date: 2015-09-30
张爱英 +1
View PDF3 Cites 86 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This kind of feature transformation using a fixed time window or span cannot dynamically use the time window or span to perform feature transformation according to the context information, and cannot reflect the dynamic characteristics of the system well.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for enhancing features by aid of bidirectional long-term and short-term memory recurrent neural networks
  • Method and system for enhancing features by aid of bidirectional long-term and short-term memory recurrent neural networks
  • Method and system for enhancing features by aid of bidirectional long-term and short-term memory recurrent neural networks

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0062] Such as image 3 As shown, the system mainly includes a bidirectional long-short-term memory recurrent neural network model training part and a feature enhancement part based on the bidirectional long-short-term memory recurrent neural network model. Wherein, the training part of the bidirectional long-short-term memory recurrent neural network model is mainly composed of a second audio input module 201 , a second feature extraction module 202 , a bidirectional long-short-term memory recurrent neural network model training module 203 , and a model storage 204 . The feature enhancement part based on the bidirectional long-short-term memory recurrent neural network model is mainly composed of the first audio input module 101 , the first feature extraction module 102 , the model loading module 103 , the enhancement module 104 , and the enhancement feature storage module 105 . The input of the second audio input module 201 includes noisy audio with a sampling rate of 16000 ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a system for enhancing features by the aid of bidirectional long-term and short-term memory recurrent neural networks. The method includes steps of training inputted features with noise and corresponding features without noise by the aid of bidirectional long-term and short-term memory recurrent neural network models; enhancing the features by the aid of the bidirectional long-term and short-term memory recurrent neural network models. The method and the system have the advantages that the long-term and short-term memory recurrent neural network models can be built in two directions by the aid of the inputted features, and accordingly models of contexts of current frames can be effectively built; long-term and short-term memory cells and certain control variables are introduced into the neural networks, and accordingly the models can be built in the dependence on the long contexts; excellent enhancement effects can be realized by the models owing to long-term dependence when the features are enhanced by the aid of the models, and accordingly the performance of voice recognition systems and audio event classification systems can be improved.

Description

technical field [0001] The present invention relates to speech and audio information technology, in particular, the present invention relates to features for enhancing the performance of speech recognition systems, audio event detection and classification systems. Background technique [0002] With the development of computing technology and information technology, especially the proposal of deep machine learning method and its successful application in voice, image, video and other fields, the performance of speech recognition system has been greatly improved and improved, and speech recognition technology has been advanced by leaps and bounds. At the same time, there are some commercial recognition systems and application software, such as Google Voice Search, Bing Voice Search, Siri Voice Assistant, Baidu Voice Assistant, Sogou Voice Input, Xunfei Language Point, etc. Voice as a human-computer interaction tool is gradually changing The way people interact with different d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/06G10L21/02G10L25/30
Inventor 张爱英倪崇嘉
Owner 张爱英
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products