Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech processing method, device, terminal equipment and storage medium

A voice processing and voice technology, applied in voice analysis, voice recognition, instruments, etc., can solve problems such as unfavorable user voice activities, affecting voice intelligibility, receiving various problems, etc.

Active Publication Date: 2020-12-22
广州方硅信息技术有限公司
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, depending on the environment, the smart terminal often receives various background noises while receiving the voice input by the user, which affects the recognizability of the voice input by the user, and is not conducive to the user's various voice activities. Therefore, how to effectively suppress background noise when users perform voice activities needs to be solved urgently.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech processing method, device, terminal equipment and storage medium
  • Speech processing method, device, terminal equipment and storage medium
  • Speech processing method, device, terminal equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] In order to enable those skilled in the art to better understand the solutions of the present application, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. It should be understood that the specific embodiments described here are only used to explain the present application, not to limit the present application.

[0021] Speech noise reduction technology is a technology that separates the speech source signal from the background noise from the noisy audio data mixed with the speech source signal and background noise, thereby eliminating or suppressing the background noise to obtain the speech source signal. The traditional speech noise reduction technology estimates and removes the background noise in the noisy audio data to obtain the speech source signal under the assumption that the background noise signal changes little in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application discloses a voice processing method, device, terminal equipment and storage medium. The method includes: acquiring noisy audio data, which includes a voice source signal; preprocessing the noisy audio data, and obtaining the noisy audio data from Extract noisy audio features from the data and input the pre-trained speech processing network model to obtain denoised audio features. The pre-trained speech processing network model includes multiple causal convolutional layers and at least one recursive neural network layer. A causal convolutional layer is used to output the texture feature of the corresponding speech source signal according to the noise frequency feature, and at least one recursive neural network layer is used to output the denoised audio feature according to the texture feature; according to the denoised audio feature, the speech is obtained An estimate of the source signal and output as denoised noisy audio data. The application realizes real-time noise reduction of noisy audio data through the causal convolution layer and the recursive neural network layer, and improves the voice noise reduction effect.

Description

technical field [0001] The present application relates to the technical field of audio data processing, and more specifically, to a voice processing method, device, terminal equipment, and storage medium. Background technique [0002] With the rapid development of communication technology and smart terminals, people's entertainment, social activities and other behaviors based on smart terminals are more and more free from geographical and time constraints. Not only can live broadcast, calls, chats, etc. Functional activities can also realize various functions outdoors. However, depending on the environment, the smart terminal often receives various background noises while receiving the voice input by the user, which affects the recognizability of the voice input by the user, and is not conducive to the user's various voice activities. Therefore, how to effectively suppress background noise when the user performs voice activities needs to be solved urgently. Contents of th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L21/0208G10L15/06G10L25/30
CPCG10L15/063G10L21/0208G10L25/30
Inventor 黄杰雄戴长军黄健源
Owner 广州方硅信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products