Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech enhancement processing method

A processing method and speech enhancement technology, applied in speech analysis, instruments, etc., can solve the problems of confronting network gradient instability and reducing the amount of calculation, and achieve the effect of solving gradient instability, reducing interference, and reducing the number of iterations

Active Publication Date: 2019-03-26
SHANGHAI MARITIME UNIVERSITY
View PDF6 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to provide a speech enhancement processing method, which aims to solve the problem of unstable gradients of the generated confrontation network, and the convergence speed is faster. At the same time, the use of small batch calculations also reduces the amount of calculations.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech enhancement processing method
  • Speech enhancement processing method
  • Speech enhancement processing method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] Embodiments of the present invention are described below through specific examples, and those skilled in the art can easily understand other advantages and effects of the present invention from the content disclosed in this specification. The present invention can also be implemented or applied through other different specific implementation modes, and various modifications or changes can be made to the details in this specification based on different viewpoints and applications without departing from the spirit of the present invention.

[0034] see Figure 1-4 . It should be noted that the diagrams provided in this embodiment are only schematically illustrating the basic idea of ​​the present invention, and only the components related to the present invention are shown in the diagrams rather than the number, shape and shape of the components in actual implementation. Dimensional drawing, the type, quantity and proportion of each component can be changed arbitrarily d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech enhancement processing method. The method comprises the steps that a training sample is formed based on speech data and noise data; the training sample is preprocessedto obtain a processed denoising sample; the denoising sample is divided into multiple batches of denoising samples, a WGAN model is trained by adopting each batch of the denoising sample until training of multiple batches of the denoising samples is completed, and a final WGAN-MBGD model is obtained; an enhanced speech signal is output by adopting the final WGAN-MBGD model. The speech enhancementprocessing method has the advantages that the unstable adversarial network gradient is generated, the rate of convergence is quicker, the small-batch calculation is applied, the calculated amount isalso reduced, spectral subtraction factors and spectral lower limit factors are introduced, and the residual noise is reduced by reducing the error among frequency spectrums.

Description

technical field [0001] The invention relates to the technical field of speech processing, in particular to a speech enhancement processing method. Background technique [0002] In recent years, with the rapid development of information, the human-computer interaction system based on speech recognition has become the mainstream of research, and more and more speech processing technologies have been applied to major systems. However, these devices are usually located in a relatively complex acoustic environment, such as the sound of whistles on the street, music, birds, wind, etc. The noisy background noise often significantly deteriorates the voice quality, making voice commands unacceptable. Accurate identification, the system cannot complete a certain function, which greatly reduces the user experience and other problems. Therefore, the study of speech enhancement is a topic of practical significance. [0003] The purpose of speech enhancement is mainly to remove complex ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L21/0208
CPCG10L21/0208Y02T10/40
Inventor 张颖肖萌萌徐志京
Owner SHANGHAI MARITIME UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products