Voice processing method and device, storage medium and program product

A voice processing and voice technology, which is applied in the fields of equipment, storage media and program products, and voice processing methods, can solve problems such as joint processing, room reverberation interference, and poor accuracy, so as to avoid error transmission and improve accuracy. , the effect of reducing the size

Pending Publication Date: 2022-03-25
DINGTALK (CHINA) INFORMATION TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the process of real-time voice communication, there are not only different types of environmental noise, but also the interference of room reverberation. Therefore, the voice processing model needs to be able to realize the functions of denoising and de-reverberation at the same time. At present, the existing voice processing models are inefficient Lower and less accurate, and denoising and reverberation are treated as two separate problems without considering the problem of joint processing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice processing method and device, storage medium and program product
  • Voice processing method and device, storage medium and program product
  • Voice processing method and device, storage medium and program product

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary embodiments do not represent all implementations consistent with this application.

[0056] First, the nouns involved in this application are explained:

[0057] Denoising (Speech Denoising): Also known as noise cancellation, the noise-containing speech received by the microphone is removed through an algorithm module to preserve the fidelity of the original speech signal as much as possible.

[0058] De-reverberation (Speech Dereverberation): The reverberation-containing speech received by the microphone is removed through the algorithm module to achieve the effect that the original speech does not contain reverb...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a voice processing method and device, a storage medium and a program product. The method comprises the following steps: determining corresponding feature information of a to-be-processed voice on a plurality of frequency bands; for a sequence formed by the feature information on the plurality of frequency bands, based on a deep learning model used for processing sequence data, obtaining a processing result corresponding to each piece of feature information; and obtaining the processed voice based on the processing result corresponding to each piece of feature information. According to the invention, denoising and dereverberation can be realized at the same time based on the deep learning model, an error transmission phenomenon caused by series connection of different algorithm modules is avoided, the accuracy of the model is improved, in addition, the network model coefficient of each frequency band is shared, the size of the network model and the calculation amount during processing can be effectively reduced, and the processing efficiency is improved.

Description

technical field [0001] The present application relates to the technical field of speech processing, and in particular to a speech processing method, device, storage medium and program product. Background technique [0002] Speech enhancement technology can extract useful speech signals from noisy speech signals and restore the pure original speech as much as possible, which plays a very important role in real-time speech communication. [0003] In the process of real-time voice communication, there are not only different types of environmental noise, but also the interference of room reverberation. Therefore, the voice processing model needs to be able to realize the functions of denoising and de-reverberation at the same time. At present, the existing voice processing models are inefficient Lower and less accurate, and denoising and reverberation are treated separately as two problems without considering the problem of joint processing. Contents of the invention [0004]...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/02G10L17/04G10L21/02G10L21/0208
CPCG10L21/02G10L15/02G10L17/04G10L21/0208
Inventor 熊飞飞冯津伟
Owner DINGTALK (CHINA) INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products