Improved self-coding neural network voice enhancement algorithm

A neural network and speech enhancement technology, which is applied in speech analysis, instruments, etc., can solve the problems that the non-stationary noise signal filtering effect is not good enough, affects the speech enhancement effect, and has a large range of signal-to-noise ratio changes.

Inactive Publication Date: 2018-04-27
HARBIN UNIV OF SCI & TECH
View PDF4 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, both time domain and frequency domain enhancement algorithms have the following disadvantages. First of all, these algorithms will add some unreasonable assumptions for the convenience of pushing to the process, and these unreasonable assumptions will definitely affect Speech enhancement effect. Secondly, these enhancement algorithms can have a good filtering effect on stationary noise signals, but the filtering effect on non-stationary noise signals is not good enough. Moreover, there will still be a small amount of noise remaining in the audio after speech enhancement. "Music noise" will also be added. Finally, there are many types of noise in real life, and the range of signal-to-noise ratio is large.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Improved self-coding neural network voice enhancement algorithm
  • Improved self-coding neural network voice enhancement algorithm
  • Improved self-coding neural network voice enhancement algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] The best embodiment of the present invention will be described in detail below in conjunction with the description of the accompanying drawings. Through this description, we will have a deeper understanding of the problems raised in the background technology that the present invention can solve in principle, of course This embodiment is not all the embodiments, and all other embodiments based on the principle of the present invention belong to the protection scope of the present invention.

[0020] see figure 1 , figure 2 and image 3 , the embodiment of the present invention comprises:

[0021] An improved self-encoding neural network speech enhancement algorithm, including:

[0022] First of all, it is the pre-processing work before data preparation and voice signals enter the neural network, which will be described as follows. One of the purposes of the present invention is to make the network have a good generalization effect on noise types and signal-to-noise r...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an improved self-coding neural network voice enhancement algorithm and aims at problems in the voice enhancement effect existing in traditional voice enhancement algorithms such as a spectral subtraction algorithm and a Wiener filtering method, e.g., the poor filtering effect for the non-stable noise, residual of music noise after enhancement and the poor generalization effect for noise types and the signal to noise ratio. According to the algorithm, a three-level junction of the self-coding neural network is enhanced to be a five-level junction, the neuron numbers of the correspondingly levels are respectively 256, 128, 64, 128 and 256, 100 types of noise are added to pure audios according to signal to noise ratios of -5dB, 0dB, 5dB, 10dB and 15dB, mass data sets are constructed to train the network, the excellent voice enhancement effect can be realized after the network model is trained, as the training data is big, the excellent generalization effect for thenoise types and the signal to noise ratio is realized.

Description

technical field [0001] The invention relates to the technical field of speech signal processing, in particular to an improved self-encoding neural network speech enhancement algorithm. Background technique [0002] In recent years, speech recognition technology has developed rapidly, and the pursuit of higher recognition rate has also accelerated the rapid development of speech enhancement technology. Looking at the development of speech enhancement for decades, great progress has been made in theory and practice. Time The enhancement algorithms in the frequency domain include parameter and filtering methods, and the more mature algorithms in the frequency domain include spectral subtraction and Wiener filtering. These mature algorithms can indeed achieve good enhancement effects on stationary noise. [0003] However, both time domain and frequency domain enhancement algorithms have the following disadvantages. First of all, these algorithms will add some unreasonable assump...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/0216G10L25/30
CPCG10L21/0216G10L25/30
Inventor 黄金杰王雅君陆春宇
Owner HARBIN UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products