A voice processing method, device and electronic equipment

A voice processing and voice enhancement technology, which is applied in the field of data processing, can solve the problems of poor voice enhancement effect and lower voice enhancement efficiency, and achieve the effect of ensuring efficiency and good voice enhancement effect

Active Publication Date: 2022-06-07
BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, if the convolution kernel of the neural network is not large enough and the number of convolution layers is not large enough, the speech enhancement effect will be poor; but if in order to improve the speech enhancement effect, the convolution kernel in the neural network and the number of convolution layers will be increased. Reduced efficiency of speech enhancement

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A voice processing method, device and electronic equipment
  • A voice processing method, device and electronic equipment
  • A voice processing method, device and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] In order to make the above objects, features and advantages of the present invention more clearly understood, the present invention will be described in further detail below with reference to the accompanying drawings and specific embodiments.

[0046] One of the core concepts of the embodiments of the present invention is to introduce a self-attention mechanism in the speech enhancement process, so that information at any global position can be considered, and attention can be focused on more important content; thus, the speech enhancement effect is improved. while ensuring the efficiency of speech enhancement.

[0047] refer to figure 1 , showing a flow chart of the steps of an embodiment of a speech processing method of the present invention, which may specifically include the following steps:

[0048] Step 102: Acquire the voice data to be processed.

[0049] In the embodiment of the present invention, when a certain piece of voice data needs to be played, or a ce...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Embodiments of the present invention provide a voice processing method, device, and electronic equipment, wherein the method includes: acquiring voice data to be processed; performing voice enhancement on the voice data to be processed by using a target voice enhancement model, the target voice The enhancement model is integrated by the initial speech enhancement model and the self-attention mechanism; since the self-attention mechanism can consider the information of any global position and focus on more important content, there is no need to increase the initial speech enhancement model The convolutional layer and the increased convolution kernel can achieve a better speech enhancement effect and ensure the efficiency of speech enhancement.

Description

technical field [0001] The present invention relates to the technical field of data processing, and in particular, to a voice processing method, device and electronic device. Background technique [0002] With the rapid development of communication technology, terminals such as mobile phones and tablet computers are becoming more and more popular, which brings great convenience to people's life, study and work. The user usually uses the terminal to input voice commands for voice photography, voice search, etc.; and also uses the terminal to play voice data (such as music, video, and recording). Among them, in order to enable the terminal to better execute the user's voice commands and play voice data with higher quality, the terminal can use voice enhancement technology to suppress and reduce noise interference in voice data after collecting voice commands or before playing voice data. Extract useful speech data from noisy backgrounds. [0003] At present, neural networks ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L21/02G10L25/03G10L25/30
CPCG10L21/02G10L25/03G10L25/30
Inventor 文仕学郝翔潘逸倩
Owner BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products