Voice enhancement method based on DNN-CLSTM network

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A voice enhancement and network technology, applied in voice analysis, instruments, etc., can solve problems such as signal intelligibility, poor voice quality, and voice signal distortion

Pending Publication Date: 2021-04-30

XIAN UNIV OF POSTS & TELECOMM

View PDF0 Cites 8 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] The purpose of the present invention is to solve problems such as speech signal distortion, signal i

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Login to View More

PUM

Login to View More

Abstract

The invention relates to a speech enhancement method based on a deep neural network and a residual long-short term memory (DNN-NCLSTM) network. According to the method, voice amplitude characteristics obtained through spectral subtraction and voice Mel-frequency cepstrum coefficient (MFCC) characteristics obtained through fast Fourier transform are input into a DNN-CLSTM network model, and the purpose of voice enhancement is achieved. The method comprises the following steps: firstly, carrying out time-frequency masking and windowing framing processing on noisy speech, obtaining the amplitude and phase characteristics of the noisy speech by utilizing fast Fourier transform, and estimating the noise amplitude of the noisy speech; secondly, subtracting the estimated noise signal amplitude from the noise-containing voice amplitude to obtain a voice signal amplitude after spectral subtraction, and taking the voice signal amplitude as a first feature of neural network input; then, performing fast Fourier transform (FFT) on the noise-containing voice, and solving spectral line energy of the voice signal to obtain an MFCC feature of the noise-containing voice as a second feature of the voice signal; inputting the two features into the DNN-CLSTM network for training to obtain a network model, and evaluating the effectiveness of the model by adopting a minimum mean square error (MMSE) loss function evaluation index; and finally, inputting the actual noise-containing voice set into a trained voice enhancement network model, predicting an estimated amplitude and MFCC after enhancement, and obtaining a final enhanced voice signal by adopting inverse Fourier transform. The method has high fidelity of voice.

Description

technical field [0001] The invention belongs to the technical field of speech enhancement, and in particular relates to a speech enhancement method based on a DNN-CLSTM network. Background technique [0002] Voice, as one of the main ways of information transmission, has been widely used in life. With the development of technology, voice not only plays the role of information transmission between people, but also widely used in human-computer interaction. voice signal. However, in our communication process, speech signals are often accompanied by a large number of noise signals, such as background noise such as factory noise, car noise, or restaurant noise. A speech signal containing a lot of noise will cause a lot of interference when the receiver extracts useful information contained in the speech signal. In response to this problem, speech signal enhancement technology has received extensive attention. [0003] Speech enhancement refers to the process of separating the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L21/0208G10L21/0216G10L25/24G10L25/30

CPCG10L21/0208G10L21/0216G10L25/24G10L25/30

Inventor 汪友明张天琦

Owner XIAN UNIV OF POSTS & TELECOMM

Voice enhancement method based on DNN-CLSTM network

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Method used

Image

Examples

PUM

Abstract

Description

Claims

Application Information

Agents

Company

Voice enhancement method based on DNN-CLSTM network

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

PUM

Abstract

Description

Claims

Application Information

Agents

Company

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology