Voice enhancement method and device based on convolutional neural network, equipment and medium

A convolutional neural network and speech enhancement technology, which is applied in the field of speech enhancement based on convolutional neural network, can solve the problem of low computational efficiency and accuracy of speech enhancement models, and achieve the goal of refining image features, enhancing speech, and improving accuracy. Effect
CN113345463APending Publication Date: 2021-09-03PING AN TECH (SHENZHEN) CO LTD

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
PING AN TECH (SHENZHEN) CO LTD
Publication Date
2021-09-03

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention relates to the technical field of artificial intelligence, and particularly relates to a speech enhancement method and device based on a convolutional neural network, equipment and a medium. The speech enhancement method based on the convolutional neural network comprises the following steps: acquiring a time domain oscillogram of speech to be denoised and a speech enhancement model, wherein the speech enhancement model comprises a Gabor convolution layer, a simple recursion layer, a feature masking layer and a deconvolution layer which are connected in sequence; carrying out Gabor transformation on the time domain oscillogram through a complex filter, and extracting Gabor transformation features; inputting the Gabor transformation features into a simple recursion layer for prediction so as to determine a masking vector corresponding to a feature masking layer; filtering the Gabor transformation features according to the masking vector through the feature masking layer to obtain denoised Gabor transformation features; and restoring the denoised Gabor transformation features through a deconvolution layer to obtain a target denoised voice. According to the speech enhancement method based on the convolutional neural network, the model calculation efficiency and accuracy can be effectively improved.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The present invention relates to the technical field of artificial intelligence, in particular to a speech enhancement method, device, equipment and medium based on a convolutional neural network. Background technique

[0002] Speech enhancement refers to a technology that enhances the quality and clarity of useful speech signals and suppresses and reduces noise interference when speech signals are interfered or even submerged by various noises. Due to the simple design process, the end-to-end neural network model is widely used in the field of speech enhancement, but most of the current research does not effectively consider the local and sequential characteristics of speech, resulting in the computational efficiency and accuracy of the current speech enhancement model. . Contents of the invention

[0003] Embodiments of the present invention provide a speech enhancement method, device, device, and medium based on a convolutional neural network, so...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More