Improved self-coding neural network voice enhancement algorithm

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A neural network and speech enhancement technology, which is applied in speech analysis, instruments, etc., can solve the problems that the non-stationary noise signal filtering effect is not good enough, affects the speech enhancement effect, and has a large range of signal-to-noise ratio changes.

Inactive Publication Date: 2018-04-27

HARBIN UNIV OF SCI & TECH

View PDF4 Cites 12 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] However, both time domain and frequency domain enhancement algorithms have the following disadvantages. First of all, these algorithms will add some unreasonable assumptions for the convenience of pushing to the process, and these unreasonable assumptions will definitely affect Speech enhancement effect. Secondly, these enhancement algorithms can have a good filtering effect on stationary noise signals, but the filtering effect on non-stationary noise signals is not good enough. Moreover, there will still be a small amount of noise remaining in the audio after speech enhancement. "Music noise" will also be added. Finally, there are many types of noise in real life, and the range of signal-to-noise ratio is large.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0019] The best embodiment of the present invention will be described in detail below in conjunction with the description of the accompanying drawings. Through this description, we will have a deeper understanding of the problems raised in the background technology that the present invention can solve in principle, of course This embodiment is not all the embodiments, and all other embodiments based on the principle of the present invention belong to the protection scope of the present invention.

[0020] see figure 1 , figure 2 and image 3 , the embodiment of the present invention comprises:

[0021] An improved self-encoding neural network speech enhancement algorithm, including:

[0022] First of all, it is the pre-processing work before data preparation and voice signals enter the neural network, which will be described as follows. One of the purposes of the present invention is to make the network have a good generalization effect on noise types and signal-to-noise r...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an improved self-coding neural network voice enhancement algorithm and aims at problems in the voice enhancement effect existing in traditional voice enhancement algorithms such as a spectral subtraction algorithm and a Wiener filtering method, e.g., the poor filtering effect for the non-stable noise, residual of music noise after enhancement and the poor generalization effect for noise types and the signal to noise ratio. According to the algorithm, a three-level junction of the self-coding neural network is enhanced to be a five-level junction, the neuron numbers of the correspondingly levels are respectively 256, 128, 64, 128 and 256, 100 types of noise are added to pure audios according to signal to noise ratios of -5dB, 0dB, 5dB, 10dB and 15dB, mass data sets are constructed to train the network, the excellent voice enhancement effect can be realized after the network model is trained, as the training data is big, the excellent generalization effect for thenoise types and the signal to noise ratio is realized.

Description

technical field [0001] The invention relates to the technical field of speech signal processing, in particular to an improved self-encoding neural network speech enhancement algorithm. Background technique [0002] In recent years, speech recognition technology has developed rapidly, and the pursuit of higher recognition rate has also accelerated the rapid development of speech enhancement technology. Looking at the development of speech enhancement for decades, great progress has been made in theory and practice. Time The enhancement algorithms in the frequency domain include parameter and filtering methods, and the more mature algorithms in the frequency domain include spectral subtraction and Wiener filtering. These mature algorithms can indeed achieve good enhancement effects on stationary noise. [0003] However, both time domain and frequency domain enhancement algorithms have the following disadvantages. First of all, these algorithms will add some unreasonable assump...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L21/0216G10L25/30

CPCG10L21/0216G10L25/30

Inventor黄金杰王雅君陆春宇

OwnerHARBIN UNIV OF SCI & TECH

Improved self-coding neural network voice enhancement algorithm

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology