Speech enhancement method for generating adversarial network based on two-dimensional spectrogram and conditions

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A conditional generation and speech enhancement technology, applied in speech analysis, instruments, etc., can solve problems such as uncontrollability, and achieve the effect of strong robustness, good generalization performance, and improved score

Active Publication Date: 2020-01-21

SOUTHEAST UNIV

View PDF8 Cites 12 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Speech enhancement based on semi-supervised GAN (Generative Adversarial Nets) achieves end-to-end speech enhancement and improves the generalization performance of the algorithm. However, the GAN network belongs to semi-supervised learning and does not specify the corresponding label, resulting in generation Network G is relatively free and uncontrollable when processing large data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0052] The technical solutions provided by the present invention will be described in detail below in conjunction with specific examples. It should be understood that the following specific embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention.

[0053] Such as figure 1 As shown, the speech enhancement method based on two-dimensional spectrogram and conditional generative confrontation network provided by this embodiment includes the following steps:

[0054] Step 1, adding different types of noises with different signal-to-noise ratios in the training and testing speech signals to obtain noisy training and testing speech signals, the calculation formula is:

[0055] d(n)=s(n)+v(n)

[0056] Among them, d(n) represents the speech signal after adding noise, s(n) represents the monophonic speech signal, v(n) represents a certain type of noise signal under the specified signal-to-noise ratio, and n represent...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a speech enhancement method for generating an adversarial network based on a two-dimensional spectrogram and conditions. The method comprises the following steps of forming thetwo-dimensional spectrogram through a plurality of frame spectrums obtained by performing short-time Fourier transform on a speech signal, generating input characteristics of the adversarial networkby serving the two-dimensional spectrogram as the condition, and generating a network G through the mutual adversarial training of a generating network G and a discrimination network D. In the testingprocess, the two-dimensional spectrogram of the noisy speech is extracted, and the G network obtained in the training stage directly maps the noisy speech spectrogram into an enhanced spectrogram, thereby realizing speech enhancement. Through the speech enhancement algorithm for generating the adversarial network based on the spectrogram and the conditions disclosed by the invention, the perception quality of the enhanced speech is greatly improved, and the algorithm has good generalization performance and stronger robustness.

Description

technical field [0001] The invention relates to a speech enhancement method based on a two-dimensional spectrogram and a conditional generation confrontation network, and belongs to the technical field of speech enhancement. Background technique [0002] Speech enhancement refers to the technology that the speech signal is interfered or suppressed by noise, and the effective signal is extracted from the background noise. Its purpose is to eliminate the influence of noise and interference as much as possible, improve the signal-to-noise ratio and speech intelligibility, and improve speech quality. Speech enhancement technology can improve the overall performance of the speech signal processing system. [0003] Currently, there are many kinds of speech enhancement algorithms, which can be classified according to different classification standards. According to the number of sensors or microphones, it can be divided into single-channel (single-microphone) speech enhancement an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L21/0208G10L25/27

CPCG10L21/0208G10L25/27

Inventor周琳钟秋月陆思源李楠

OwnerSOUTHEAST UNIV

Speech enhancement method for generating adversarial network based on two-dimensional spectrogram and conditions

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology