Unsupervised noise estimation and speech enhancement method based on separable deep automatic encoding technology

A technology of automatic coding and speech enhancement, applied in speech analysis, instruments, etc., can solve problems such as influence effect and mismatch

Active Publication Date: 2015-11-04
PLA UNIV OF SCI & TECH
View PDF5 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the case of unknown noise, or the characteristics of the unknown noise are very d

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unsupervised noise estimation and speech enhancement method based on separable deep automatic encoding technology
  • Unsupervised noise estimation and speech enhancement method based on separable deep automatic encoding technology
  • Unsupervised noise estimation and speech enhancement method based on separable deep automatic encoding technology

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0089] Example

[0090] The application principle of the present invention will be further described below with reference to the drawings and specific embodiments.

[0091] Combine figure 1 The implementation process of the unsupervised noise estimation and speech enhancement method based on the separable deep automatic coding technology of the present invention is as follows.

[0092] S101: Randomly select 500 sentences from different genders and different speakers from the English classic database TIMIT, sample them to 8kHz, shift the frame with a window length of 64ms and 8ms as parameter framing, and then do a 512-point fast Fu The inner leaf transform, after taking the modulus, extract their amplitude spectrum S;

[0093] S102, then perform non-negative matrix decomposition on S to train a non-negative speech dictionary D that can represent speech signals, where the size of the dictionary, that is, the number of basis functions is selected as 2000;

[0094] S103: Next, perform dee...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an unsupervised noise estimation and speech enhancement method based on a separable deep automatic encoding technology. The method comprises a step of prior processing and a step of speech enhancement on unknown noise pollution. The method can be flexibly applied to various speech processing scenes. The method is not constrained by a language of a speech content, changes of a speaker, and a kind of noise. Compared with classical stationarity assumption-based spectral estimation algorithm SS (Spectrum Subtraction) and MMSE (Minimum Mean Square Error), the method does not rely on stationarity assumption and can accurately estimate spectrums for stationary or abrupt noise. Compared with algorithm based on a hidden Markov model and a linear prediction coefficient, the method does not need to specify the type of the processed non-stationary noise. Compared with a noise estimation method based on a low rank structure, noise in the method does not need to have a low-rank repeated structure.

Description

technical field [0001] The invention belongs to the technical field of speech signal processing, in particular to an unsupervised noise estimation and speech enhancement method based on separable deep automatic coding technology. Background technique [0002] Speech enhancement is of great significance both for improving the auditory effect of the speech signal and as a front-end processing to improve the performance of the speech recognizer. The core issue of speech enhancement is the separation of speech noise. The ideal speech enhancement technology needs to be able to achieve good results under the premise of unknown noise. For this reason, a key problem that needs to be solved in speech enhancement is the problem of noise estimation. In order to estimate the noise spectrum, some classical algorithms have been proposed, such as Spectrum Subtraction (SS), Minimum Mean Square Error (MMSE), etc., and have been widely used in voice communication. However, these methods are...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L21/0208G10L19/008
Inventor 孙蒙李轶南张雄伟王艺敏邹霞贾冲李莉
Owner PLA UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products