Speech enhancement algorithm based on speech existence probability and auditory masking effect

An auditory masking effect and speech enhancement technology, applied in speech analysis, instruments, etc., can solve the problems of speech distortion, large noise estimation deviation, and inability to perceive, etc., to achieve the effect of eliminating noise and ensuring the quality of perception

Pending Publication Date: 2021-07-23
NANJING UNIV OF SCI & TECH
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to overcome the problem of speech distortion after spectral subtraction due to the large noise estimation deviation in the existing spectral subtraction method, and propose a speech enhancement algorithm based

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech enhancement algorithm based on speech existence probability and auditory masking effect
  • Speech enhancement algorithm based on speech existence probability and auditory masking effect
  • Speech enhancement algorithm based on speech existence probability and auditory masking effect

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0073] The specific implementation method of the present invention will be further described in detail below in conjunction with the accompanying drawings. The following embodiments or drawings are used to illustrate the present invention, but are not used to limit the scope of the present invention, and the described illustrative embodiments are only to illustrate the various steps of the present invention.

[0074] The invention designs a new speech enhancement algorithm by using the auditory masking characteristic of human ears and an improved noise power spectrum estimation method. In terms of auditory characteristics, the presence of speech signals increases the hearing threshold of noise, and the greater the energy of the speech signal, the higher the masking threshold of noise and the harder it is to detect. According to this feature, the present invention does not completely suppress the noise in the spectral subtraction, but makes the residual noise intensity below th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a speech enhancement algorithm based on a speech existence probability and an auditory masking effect, and the algorithm comprises the steps: carrying out the preprocessing of an input time-domain speech signal, obtaining a frequency-domain speech signal, and keeping a phase angle; carrying out voice existence probability calculation on the obtained frequency domain signal, and obtaining an estimated noise power spectrum; performing noise masking threshold calculation on the obtained frequency domain signal to obtain a spectral subtraction coefficient value of each frequency point; and performing spectral subtraction in combination with the estimated noise power spectrum and the spectral subtraction coefficient to obtain a pure voice spectrum, and then performing inverse Fourier transform in combination with the reserved phase angle to obtain a pure time domain voice signal. According to the invention, the hearing masking effect of the human ear is utilized, the masking threshold value of the noise signal entering the human ear is calculated, noise estimation is combined, the noise can be eliminated, meanwhile, the perception quality of the voice can be guaranteed as much as possible, and more suddenly-changed peak values are not prone to occurring in the voice signal.

Description

technical field [0001] The invention relates to speech signal enhancement technology, in particular to a speech enhancement algorithm based on speech existence probability and auditory masking effect. Background technique [0002] With the development of technologies such as speech recognition, the field of speech enhancement in its front-end preprocessing has become more and more important. At present, speech enhancement algorithms mainly include spectral subtraction, wavelet transform, Wiener filter and so on. Spectral subtraction can better suppress the noise when the signal-to-noise ratio of the input signal is high, but when the signal-to-noise ratio is low, there are more noise residues. The spectral subtraction method is simple and low in complexity, but the noise estimation deviation is relatively large, and half-wave rectification is used for the negative value obtained after spectral subtraction, which leads to the appearance of "music noise" and seriously affects...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L21/0216G10L21/0224G10L21/0232G10L25/18G10L25/21G10L25/27G10L25/84
CPCG10L21/0216G10L21/0232G10L21/0224G10L25/18G10L25/21G10L25/27G10L25/84
Inventor 程伊鑫樊卫华
Owner NANJING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products