Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech enhancement method, device, apparatus and storage medium

A voice enhancement and voice technology, which is applied in the field of equipment, storage media, devices, and voice enhancement methods, and can solve problems such as poor voice enhancement effect

Active Publication Date: 2019-03-01
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF6 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Embodiments of the present invention provide a voice enhancement method, device, device and storage medium to solve the problem of poor voice enhancement effect in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech enhancement method, device, apparatus and storage medium
  • Speech enhancement method, device, apparatus and storage medium
  • Speech enhancement method, device, apparatus and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0087] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0088] The speech enhancement method provided by the embodiment of the present invention can be applied to any device that needs to perform speech enhancement, that is, a speech enhancement device. The voice enhancement device may be, for example, a smart speaker, a car navigation, a device equipped with DuerOS, a smart TV, a smart refrigerator, a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a speech enhancement method, a speech enhancement device, a speech enhancement apparatus and a storage medium. The method includes the following steps that: the speech featuresof speech to be enhanced are obtained; the speech features of the speech to be enhanced are inputted into an enhancement model, so that the ideal ratio film (IRM) of the speech to be enhanced is obtained, wherein the enhancement model is a model which is implemented based on a generative adversarial network (GAN) and is used for obtaining the IRM according to the speech features; and the speech enhancement result of the speech to be enhanced is obtained according to the speech features of the speech to be enhanced and the IRM of the speech to be enhanced. With the speech enhancement method, the speech enhancement device, the speech enhancement apparatus and the storage medium of the invention adopted, a speech enhancement effect can be improved.

Description

technical field [0001] The present invention relates to the field of speech, in particular to a speech enhancement method, device, equipment and storage medium. Background technique [0002] Speech enhancement refers to the technology of extracting useful speech signals from the noise background to suppress and reduce noise interference when the speech signal is interfered or even submerged by various noises. [0003] In the prior art, speech enhancement based on deep learning is mainly realized through deep neural network (Deep Neural Networks, DNN), convolutional neural network (Convolutional Neural Network, CNN) or recurrent neural network (Recurrent neural Network, RNN). Also, DNNs, CNNs, and RNNs mainly model noise with known distributions. [0004] However, since the distribution of speech noise is usually complex and unknown, implementing speech enhancement based on deep learning through DNN, CNN or RNN has the problem of poor speech enhancement effect. Contents of...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L21/0208G10L25/30
CPCG10L21/0208G10L25/30
Inventor 成学军
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products