Generative adversarial network speech enhancement method based on sparse continuous constraint

A network voice and sparse technology, applied in voice analysis, voice recognition, instruments, etc., can solve problems such as non-stationary noise interference and difficult estimation of real voice distribution, and achieve the effect of improving voice intelligibility

Pending Publication Date: 2021-07-02
NANCHANG HANGKONG UNIVERSITY
View PDF7 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The problem to be solved by the present invention is to provide a generative adversarial network speech enhancement method based on sparse continuous constraints to solve the problem of "music noise", non-stationary noise interference, and real speech distribution in the existing methods in the environment of low signal-to-noise ratio. inestimable problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Generative adversarial network speech enhancement method based on sparse continuous constraint
  • Generative adversarial network speech enhancement method based on sparse continuous constraint
  • Generative adversarial network speech enhancement method based on sparse continuous constraint

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The implementation of the present invention will be described in detail below with reference to the drawings and examples, so as to fully understand and implement the implementation process of how to use technical means to solve technical problems and achieve technical effects in the present invention.

[0031] A generative adversarial network speech enhancement method based on sparse continuous constraints, to achieve speech denoising in low signal-to-noise ratio environments, such as figure 1 As shown, its specific implementation steps are as follows:

[0032] 1) Data collection and classification

[0033] (1.1) data collection: the present invention example adopts the sp01~sp30 speech of NOIZEUS storehouse as pure speech, adopts the babble noise in the NOISEX~92 noise storehouse, white noise, hfchannel noise and buccaneer1 noise are as noise signal, and sampling frequency is 8KHz;

[0034] (1.2) Data classification: The four kinds of noises described in (1.1) were s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a generative adversarial network speech enhancement method based on sparse continuous constraint. The method comprises the following steps of: 1) collecting and classifying data; 2) carrying out speech framing and windowing; 3) carrying out amplitude compression; 4) inputting sparsity constraint-based generative adversarial network training; 5) carrying out amplitude decompression; and 6) synthesizing an enhanced speech. The method has the advantages that distribution of clean voice samples is finally obtained through adversarial learning between the generative model and the discrimination model in the generative adversarial network; there is no assumption for statistical distribution of voice or noise; and sparsity and continuity constraints are added to a loss function of the generator, so that the obtained sparse matrix can better conform to speech spectrum distribution. According to the method, the problem that the voice and noise signal distribution is difficult to estimate is ingeniously solved, the voice intelligibility is improved, and the enhanced speech more conforming to the pure voice spectrum distribution is obtained.

Description

technical field [0001] The invention relates to the technical field of voice processing, in particular to a method for enhancing voice over a generative confrontation network based on sparse continuous constraints. Background technique [0002] As the main medium of human communication, voice has already played an important role in mobile communication, voice assistant and other fields. Against the background of artificial intelligence technology and Internet of Things technology in the ascendant, the wide application of speech recognition, voiceprint recognition and other technologies also put forward higher requirements for the quality of speech signals. However, in actual speech acquisition and dialogue communication scenarios, speech signals are often interfered by various noises. Speech enhancement is an effective technique to solve noise pollution. [0003] There are four main traditional speech enhancement methods: (1) spectral subtraction, which uses the short-term...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/06G10L15/08G10L15/26G10L21/0224G10L21/0232G10L21/045G10L25/45
CPCG10L15/063G10L15/083G10L15/26G10L21/0224G10L21/045G10L21/0232G10L25/45
Inventor 孙成立袁丛琳邹强洪依
Owner NANCHANG HANGKONG UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products