Speech enhancement algorithm based on attention mechanism

A speech enhancement and attention technology, applied in speech analysis, biological neural network models, instruments, etc., can solve problems such as limiting model performance, inability to effectively deal with complex noise changes, and model performance impact, and achieve speech noise reduction quality, The effect of good handling mechanisms
CN110299149AInactive Publication Date: 2019-10-01UNIV OF ELECTRONICS SCI & TECH OF CHINA

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
UNIV OF ELECTRONICS SCI & TECH OF CHINA
Publication Date
2019-10-01
Estimated Expiration
Not applicable · inactive patent

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention discloses a speech enhancement algorithm based on an attention mechanism. A neural network speech enhancement model based on the attention mechanism is constructed, and comprises three components of a neural network based on the attention mechanism, a standard deep loop neural network and a time-frequency masking layer. At each time step, the model performs attention mechanism calculation on an incoming frame at current time and speech frames of an entire segmentto obtain feature vector expression corresponding to the current time step. The model input is obtained by splicing thecurrent time step feature vector with the current speech frame, and the current input is encoded by the standard deep loop neural network to obtain a predicted value of time-frequency masking. The predicted value of the time-frequency masking is multiplied by the mixed speech step-by-step to obtain an enhanced speech segment. The algorithm models a speech enhancement problem from the perspectiveof improving the generalization performance of the model, and can effectively solve the speech enhancement problem in a noise scene which does not appear in the training.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to the field of speech signal processing, in particular to a speech enhancement algorithm based on an attention mechanism. Background technique

[0002] Speech enhancement is a fundamental problem in the field of speech processing. At present, speech-based human-computer interaction is booming. Under laboratory conditions, algorithms such as speech recognition and speaker recognition already have a high accuracy rate, but in the application of actual scenarios, the existence of noise makes the accuracy of these speech applications Therefore, reducing the interference of noise to speech signals is an urgent problem to be solved. At present, the speech enhancement algorithm based on deep learning has received a lot of attention, produced a lot of valuable work, and attracted the interest of a large number of researchers.

[0003] The speech enhancement algorithm based on deep learning is a data-driven method, and the performance o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More