Speech enhancement method based on constraint naive generative adversarial network

A speech enhancement and simple technology, applied in biological neural network models, speech analysis, neural learning methods, etc., can solve the problem of difficulty in estimating the distribution of speech and noise signals, and achieve the effect of improving speech intelligibility and avoiding phase distortion.

Inactive Publication Date: 2020-02-18
NANCHANG HANGKONG UNIVERSITY
View PDF7 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The problem to be solved by the present invention is to provide a speech enhancement method based on constrained naive generative confrontation network, which skillfully solves the problem that the distribution of speech and noise signals is difficult to estimate, helps to improve speech intelligibility, and avoids phase distortion

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech enhancement method based on constraint naive generative adversarial network
  • Speech enhancement method based on constraint naive generative adversarial network
  • Speech enhancement method based on constraint naive generative adversarial network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The implementation of the present invention will be described in detail below with reference to the drawings and examples, so as to fully understand and implement the implementation process of how to use technical means to solve technical problems and achieve technical effects in the present invention.

[0030] The present invention adopts such as figure 1 The flow chart of the speech enhancement method based on the constrained naive generative confrontation network (CN-GAN) shown in the figure realizes speech denoising in a low signal-to-noise ratio environment. The specific implementation steps are as follows:

[0031] 1) Noise data collection and labeling

[0032] (1.1) data collection: the present invention example adopts the sp01~sp30 speech of NOIZEUS storehouse as pure speech, adopts the babble noise in the NOISEX~92 noise storehouse, white noise, hfchannel noise and buccaneer1 noise are as noise signal, and sampling frequency is 8KHz;

[0033] (1.2) Data labeli...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a speech enhancement method based on a constraint naive generative adversarial network. The method comprises the following steps of 1) performing noise data collection and marking; 2) performing voice framing and windowing; 3) performing amplitude compression; 4) inputting the constraint naive generative adversarial network for training; 5) performing amplitude decompression; 6) performing inverse short-time Fourier transform to generate an enhanced speech. The method has the advantages that by adversarial learning between a generative model and a discriminant model inthe generative adversarial network, the sample generation capability of the generative model is continuously enhanced and finally distribution of clean speech samples is obtained; no any assumption exists for statistical distribution of speeches or noises; and a complex number spectrum mapping method is adopted, so that phase information is added in training samples. According to the method, the problem that speech and noise signal distribution is difficult to estimate is ingeniously solved, the speech intelligibility is improved, and phase distortion is avoided.

Description

technical field [0001] The invention relates to the technical field of speech processing, in particular to a speech enhancement method based on a constrained naive generative confrontation network. Background technique [0002] As the main medium of human communication, speech has already played an important role in the fields of mobile communication and multimedia technology. In the context of the ascendant artificial intelligence, the wide application of speech recognition, voiceprint recognition and other technologies has also put forward higher requirements for the quality of speech signals. However, in actual speech acquisition and dialogue communication scenarios, speech signals are often interfered by various noises, mainly including background noise, channel noise and interference noise. Speech enhancement is an effective technique to solve noise pollution. [0003] There are four main traditional speech enhancement methods: (1) spectral subtraction, which uses the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/02G10L21/0208G10L25/30G06N3/08
CPCG06N3/08G10L21/02G10L21/0208G10L25/30
Inventor 袁丛琳孙成立
Owner NANCHANG HANGKONG UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products