Speech enhancement model training method and device and speech enhancement method and device

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A voice enhancement and voice technology, applied in voice analysis, instruments, etc., can solve problems such as voice signal interference, and achieve the effect of improving performance and improving effect

Pending Publication Date: 2021-12-21

SHANGHAI WINGTECH INFORMATION TECH CO LTD

View PDF0 Cites 4 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] However, with the methods of related technologies, since most of them are modeling noise signals, it is assumed that the background noise environment is approximately stationary relative to the region where the target speech exists, so as to use the noise spectrum of the non-speech segment to estimate the noise of the speech segment In addition, it is assumed that the noise signal and the target speech signal are not correlated with each other, and the relationship is additive in the frequency domain

However, in practical applications, the background noise signal does not satisfy these two assumptions, so the enhanced speech signal usually has background noise interference

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0059] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0060] The training method of the speech enhancement model provided by this application can be applied to such as figure 1 shown in the application environment. The training method of the speech enhancement model is applied in the speech enhancement system. The speech enhancement system includes a terminal device 101 and a sound collection device 102 . Wherein, the terminal device 102 communicates with the sound collection device 102 through a network. By obtaining the speech training set, wherein the speech training set includes noisy speech samples and pure speech sam...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to the technical field of speech processing, and provides a speech enhancement model training method and device and a speech enhancement method and device. The training method of the speech enhancement model comprises the following steps: acquiring a speech training set, wherein the voice training set comprises noisy voice samples and pure voice samples; acquiring an amplitude spectrum corresponding to the noisy voice sample, inputting the amplitude spectrum into the generation network, and acquiring an enhanced voice amplitude spectrum; acquiring an amplitude spectrum corresponding to the pure voice sample and an enhanced voice amplitude spectrum, and inputting the amplitude spectrum and the enhanced voice amplitude spectrum into a discrimination network to acquire a discrimination result; and adjusting network parameters of the generation network and the discrimination network according to the enhanced voice amplitude spectrum, the amplitude spectrum corresponding to the pure voice sample, the discrimination result and the optimization target, and generating a voice enhancement model. By adopting the method, the performance of the speech enhancement model can be improved, and the speech enhancement effect is further improved.

Description

technical field [0001] The present application relates to the technical field of speech processing, in particular to a speech enhancement model training method and device, and a speech enhancement method and device. Background technique [0002] As one of the mediums of human communication and perception, speech plays an important role in the communication between human beings and the interactive application between human beings and machines. However, in practice, most of the speech signals perceived by users usually contain background noise and interference sound sources. For example, at a noisy dance party, the sounds received by users during communication include not only the target speech of the other speaker, but also Including the background noise of the dance scene and the interfering sound sources of other speakers, it is a typical "cocktail dance" problem. The human ear can clearly judge the content of the target voice of the other speaker by virtue of its unique au...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L21/02G10L21/0208G10L21/0324G10L25/51G10L25/78

CPCG10L21/02G10L25/78G10L25/51G10L21/0208G10L21/0324

Inventor 张雪宋广伟

Owner SHANGHAI WINGTECH INFORMATION TECH CO LTD

Speech enhancement model training method and device and speech enhancement method and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology