Generative adversarial network speech enhancement method based on sparse continuous constraint

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A network voice and sparse technology, applied in voice analysis, voice recognition, instruments, etc., can solve problems such as non-stationary noise interference and difficult estimation of real voice distribution, and achieve the effect of improving voice intelligibility

Pending Publication Date: 2021-07-02

NANCHANG HANGKONG UNIVERSITY

View PDF7 Cites 2 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The problem to be solved by the present invention is to provide a generative adversarial network speech enhancement method based on sparse continuous constraints to solve the problem of "music noise", non-stationary noise interference, and real speech distribution in the existing methods in the environment of low signal-to-noise ratio. inestimable problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0030] The implementation of the present invention will be described in detail below with reference to the drawings and examples, so as to fully understand and implement the implementation process of how to use technical means to solve technical problems and achieve technical effects in the present invention.

[0031] A generative adversarial network speech enhancement method based on sparse continuous constraints, to achieve speech denoising in low signal-to-noise ratio environments, such as figure 1 As shown, its specific implementation steps are as follows:

[0032] 1) Data collection and classification

[0033] (1.1) data collection: the present invention example adopts the sp01~sp30 speech of NOIZEUS storehouse as pure speech, adopts the babble noise in the NOISEX~92 noise storehouse, white noise, hfchannel noise and buccaneer1 noise are as noise signal, and sampling frequency is 8KHz;

[0034] (1.2) Data classification: The four kinds of noises described in (1.1) were s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a generative adversarial network speech enhancement method based on sparse continuous constraint. The method comprises the following steps of: 1) collecting and classifying data; 2) carrying out speech framing and windowing; 3) carrying out amplitude compression; 4) inputting sparsity constraint-based generative adversarial network training; 5) carrying out amplitude decompression; and 6) synthesizing an enhanced speech. The method has the advantages that distribution of clean voice samples is finally obtained through adversarial learning between the generative model and the discrimination model in the generative adversarial network; there is no assumption for statistical distribution of voice or noise; and sparsity and continuity constraints are added to a loss function of the generator, so that the obtained sparse matrix can better conform to speech spectrum distribution. According to the method, the problem that the voice and noise signal distribution is difficult to estimate is ingeniously solved, the voice intelligibility is improved, and the enhanced speech more conforming to the pure voice spectrum distribution is obtained.

Description

technical field [0001] The invention relates to the technical field of voice processing, in particular to a method for enhancing voice over a generative confrontation network based on sparse continuous constraints. Background technique [0002] As the main medium of human communication, voice has already played an important role in mobile communication, voice assistant and other fields. Against the background of artificial intelligence technology and Internet of Things technology in the ascendant, the wide application of speech recognition, voiceprint recognition and other technologies also put forward higher requirements for the quality of speech signals. However, in actual speech acquisition and dialogue communication scenarios, speech signals are often interfered by various noises. Speech enhancement is an effective technique to solve noise pollution. [0003] There are four main traditional speech enhancement methods: (1) spectral subtraction, which uses the short-term...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L15/06G10L15/08G10L15/26G10L21/0224G10L21/0232G10L21/045G10L25/45

CPCG10L15/063G10L15/083G10L15/26G10L21/0224G10L21/045G10L21/0232G10L25/45

Inventor孙成立袁丛琳邹强洪依

OwnerNANCHANG HANGKONG UNIVERSITY

Generative adversarial network speech enhancement method based on sparse continuous constraint

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology