Voice denoising method based on computational auditory scene analysis and countermeasure network model generation

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A network model, speech noise reduction technology, applied in speech analysis, instruments, etc., can solve the problems of auditory system damage, high intensity, reducing the accuracy of speech recognition, etc., to achieve the effect of maintaining distortion

Pending Publication Date: 2018-11-13

THE THIRD RES INST OF CHINA ELECTRONICS TECH GRP CORP

View PDF10 Cites 25 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Especially in some professional applications, external noise is unavoidable, and in many cases, the types of noise are complex and the intensity is large

This type of noise will have a serious impact on subsequent speech signal processing, such as reducing the accuracy of speech recognition

In addition, if the voice data containing noise is processed artificially, long-term work will cause damage to the human auditory system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0018] The present invention will be described in detail below in conjunction with the implementations shown in the drawings, but it should be noted that these implementations are not limitations of the present invention, and those of ordinary skill in the art based on the functions, methods, or structural changes made by these implementations Equivalent transformations or substitutions all fall within the protection scope of the present invention.

[0019] This embodiment provides a speech noise reduction method based on Computational auditory scene analysis (CASA) and Generative adversarial networks (GAN) model, including:

[0020] Step 1, based on the generator (Generator) and discriminator (Discriminator) of the generative confrontation network, the noisy speech is processed to obtain intermediate results;

[0021] Step 2: Process the intermediate results based on the computational auditory scene analysis method to obtain the final result.

[0022] The voice noise reducti...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a voice denoising method based on computational auditory scene analysis and countermeasure network model generation. The method comprises the following steps: S1, processing noise-containing noises based on a generator generating a countermeasure network and a discriminator, thus obtaining an intermediate result; and S2, processing the intermediate result based on the computational auditory scene analysis method, thus obtaining a final result. With the method provided by the invention, part of noises in the voice signals obtained under the complicated channel background environment can be removed, and that no distortion occurs on the voice part can be well kept.

Description

technical field [0001] The invention relates to a speech noise reduction method, in particular to a speech noise reduction method based on computational auditory scene analysis and generation of an adversarial network model. Background technique [0002] Speech is the most important means for human beings to transmit information to each other. A piece of speech carries rich information such as the speaker's intention, identity, and emotion. Voice signals can be transmitted through various media such as air, water, and radio. Speech signals are usually interfered by various noises during the propagation process or due to the limitation of acquisition equipment. Especially in some professional applications, external noise is unavoidable, and in many cases, the types of noise are complex and the intensity is relatively large. This type of noise will seriously affect subsequent speech signal processing, such as reducing the accuracy of speech recognition. In addition, if the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L21/0208G10L21/0308

CPCG10L21/0208G10L21/0308

Inventor陈龙张小博张晓灿

OwnerTHE THIRD RES INST OF CHINA ELECTRONICS TECH GRP CORP

Voice denoising method based on computational auditory scene analysis and countermeasure network model generation

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology