Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice processing method, voice processing device and device for processing voice

A speech processing and noisy speech technology, applied in the computer field, can solve the problems of poor speech noise reduction, noisy speech correction, and lack of consideration of the phase information of speech signals, so as to achieve phase accuracy, reduce speech distortion, and improve speech The effect of the noise reduction effect

Inactive Publication Date: 2020-02-18
BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
View PDF3 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since the ideal floating value masking does not take into account the phase information of the speech signal, the phase of the noisy speech cannot be corrected during the process of synthesizing the target speech, resulting in the inaccurate phase of the synthesized target speech. Therefore, this speech noise reduction method The lower voice distortion is greater, and the voice noise reduction effect is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice processing method, voice processing device and device for processing voice
  • Voice processing method, voice processing device and device for processing voice
  • Voice processing method, voice processing device and device for processing voice

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] The application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain related inventions, rather than to limit the invention. It should also be noted that, for the convenience of description, only the parts related to the related invention are shown in the drawings.

[0020] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present application will be described in detail below with reference to the accompanying drawings and embodiments.

[0021] Please refer to figure 1 , which shows a process 100 of an embodiment of the speech processing method according to the present application. The above-mentioned voice processing method can be run on various electronic devices, and the above-mentioned electronic device...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a voice processing method, a voice processing device and a device for processing voice. The embodiment of the method comprises the following steps of performing time-frequency analysis on noisy voice to obtain the frequency spectrum of the noisy voice in complex domains; inputting the frequency spectrum of the noisy voice in the complex fields into a pretrained time-frequency shielding prediction model; obtaining a prediction value of time-frequency masking of the noisy voice in the complex domains; multiplying the prediction value by the frequency spectrum of the noisy voice in the complex domains; generating the frequency spectrum of target voice in the noisy voice in the complex domains; and synthesizing the target voice on the basis of the frequency spectrum of the target voice in the complex domains. The embodiment has the advantages that the voice distortion degree is reduced, and the voice noise reduction effect is improved.

Description

technical field [0001] The embodiments of the present application relate to the field of computer technology, and in particular to a voice processing method, device and device for processing voice. Background technique [0002] With the development of computer technology, voice interaction products such as smart speakers and recording pens are becoming more and more abundant. Since voice interaction products receive noise and reverberation signals while receiving voice signals, in order to avoid affecting the voice recognition effect, it is usually necessary to extract the target voice from voices with noise and reverberation (such as relatively pure voice). [0003] Existing methods usually use the Ideal Ratio Mask (IRM) as the target, train a model for predicting the ideal Ratio Mask, and then use the model to obtain the predicted value of the ideal Ratio Mask for noisy speech, and then based on The predicted value obtains the masked acoustic feature, and then the masked...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/0272G10L21/0208G10L21/0232G10L21/0216
CPCG10L21/0208G10L21/0216G10L21/0232G10L21/0272
Inventor 刘允李劲东
Owner BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products