Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech enhancement model training method and device and speech enhancement method and device

A voice enhancement and voice technology, applied in voice analysis, instruments, etc., can solve problems such as voice signal interference, and achieve the effect of improving performance and improving effect

Pending Publication Date: 2021-12-21
SHANGHAI WINGTECH INFORMATION TECH CO LTD
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, with the methods of related technologies, since most of them are modeling noise signals, it is assumed that the background noise environment is approximately stationary relative to the region where the target speech exists, so as to use the noise spectrum of the non-speech segment to estimate the noise of the speech segment In addition, it is assumed that the noise signal and the target speech signal are not correlated with each other, and the relationship is additive in the frequency domain
However, in practical applications, the background noise signal does not satisfy these two assumptions, so the enhanced speech signal usually has background noise interference

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech enhancement model training method and device and speech enhancement method and device
  • Speech enhancement model training method and device and speech enhancement method and device
  • Speech enhancement model training method and device and speech enhancement method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0060] The training method of the speech enhancement model provided by this application can be applied to such as figure 1 shown in the application environment. The training method of the speech enhancement model is applied in the speech enhancement system. The speech enhancement system includes a terminal device 101 and a sound collection device 102 . Wherein, the terminal device 102 communicates with the sound collection device 102 through a network. By obtaining the speech training set, wherein the speech training set includes noisy speech samples and pure speech sam...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of speech processing, and provides a speech enhancement model training method and device and a speech enhancement method and device. The training method of the speech enhancement model comprises the following steps: acquiring a speech training set, wherein the voice training set comprises noisy voice samples and pure voice samples; acquiring an amplitude spectrum corresponding to the noisy voice sample, inputting the amplitude spectrum into the generation network, and acquiring an enhanced voice amplitude spectrum; acquiring an amplitude spectrum corresponding to the pure voice sample and an enhanced voice amplitude spectrum, and inputting the amplitude spectrum and the enhanced voice amplitude spectrum into a discrimination network to acquire a discrimination result; and adjusting network parameters of the generation network and the discrimination network according to the enhanced voice amplitude spectrum, the amplitude spectrum corresponding to the pure voice sample, the discrimination result and the optimization target, and generating a voice enhancement model. By adopting the method, the performance of the speech enhancement model can be improved, and the speech enhancement effect is further improved.

Description

technical field [0001] The present application relates to the technical field of speech processing, in particular to a speech enhancement model training method and device, and a speech enhancement method and device. Background technique [0002] As one of the mediums of human communication and perception, speech plays an important role in the communication between human beings and the interactive application between human beings and machines. However, in practice, most of the speech signals perceived by users usually contain background noise and interference sound sources. For example, at a noisy dance party, the sounds received by users during communication include not only the target speech of the other speaker, but also Including the background noise of the dance scene and the interfering sound sources of other speakers, it is a typical "cocktail dance" problem. The human ear can clearly judge the content of the target voice of the other speaker by virtue of its unique au...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/02G10L21/0208G10L21/0324G10L25/51G10L25/78
CPCG10L21/02G10L25/78G10L25/51G10L21/0208G10L21/0324
Inventor 张雪宋广伟
Owner SHANGHAI WINGTECH INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products