Voice processing method and apparatus, terminal device and storage medium

A speech processing and speech technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as unfavorable user speech activities, affecting speech intelligibility, and receiving various kinds of problems.

Active Publication Date: 2019-11-22
广州方硅信息技术有限公司
View PDF5 Cites 30 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, depending on the environment, the smart terminal often receives various background noises while receiving the voice input by the user, which affects the recognizability of the voice input by the user, and is not conducive to the user's various voice activities. Therefore, how to effectively suppress background noise when users perform voice activities needs to be solved urgently.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice processing method and apparatus, terminal device and storage medium
  • Voice processing method and apparatus, terminal device and storage medium
  • Voice processing method and apparatus, terminal device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] In order to enable those skilled in the art to better understand the solutions of the present application, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. It should be understood that the specific embodiments described here are only used to explain the present application, not to limit the present application.

[0021] Speech noise reduction technology is a technology that separates the speech source signal from the background noise from the noisy audio data mixed with the speech source signal and background noise, thereby eliminating or suppressing the background noise to obtain the speech source signal. The traditional speech noise reduction technology estimates and removes the background noise in the noisy audio data to obtain the speech source signal under the assumption that the background noise signal changes little in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a voice processing method and apparatus, a terminal device and a storage medium. The method comprises the steps of: obtaining noise-containing audio data, wherein the noise-containing audio data comprise a voice source signal; preprocessing the noise-containing audio data, extracting the noise-containing audio features from the noise-containing audio data, and inputting thenoise-containing audio features into a pre-trained voice processing network model to obtain de-noised audio features, wherein the pre-trained voice processing network model comprises multiple cause and effect convolutional layers and at least one recurrent neural network layer, the multiple cause and effect convolutional layers are configured to output texture features of the corresponding voicesource signal according to the noise-containing audio features, and the at least one recurrent neural network layer is configured to output the de-noised audio features according to the texture features; and obtaining an estimated value of the voice source signal according to the de-noised audio features, and outputting the estimated value as de-noised noise-containing audio data. According to thevoice processing method and apparatus disclosed by the invention, real-time noise reduction of the noise-containing audio data is realized by the cause and effect convolutional layers and the recurrent neural network layer, and the voice noise reduction effect is improved.

Description

technical field [0001] The present application relates to the technical field of audio data processing, and more specifically, to a voice processing method, device, terminal equipment, and storage medium. Background technique [0002] With the rapid development of communication technology and smart terminals, people's entertainment, social activities and other behaviors based on smart terminals are more and more free from geographical and time constraints. Not only can live broadcast, calls, chats, etc. Functional activities can also realize various functions outdoors. However, depending on the environment, the smart terminal often receives various background noises while receiving the voice input by the user, which affects the recognizability of the voice input by the user, and is not conducive to the user's various voice activities. Therefore, how to effectively suppress background noise when the user performs voice activities needs to be solved urgently. Contents of th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/0208G10L15/06G10L25/30
CPCG10L15/063G10L21/0208G10L25/30
Inventor 黄杰雄戴长军黄健源
Owner 广州方硅信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products