Voice enhancement network model and single-channel speech enhancement method and system

A speech enhancement and network model technology, applied in speech analysis, instruments, etc., can solve problems such as enhancement performance limitations, inconsistent speech spectrum, phase and amplitude mismatch, etc.

Pending Publication Date: 2021-03-16
BEIJING TSINGMICRO INTELLIGENT TECH CO LTD
View PDF0 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, such algorithms usually need to make certain assumptions about the characteristics of the speech signal and noise and whether they are related to each other, which limits their enhancement performance.
Most of the speech enhancement algorithms based on deep learning currently use frequency-domain features, such as short-time Fourier transform magnitude spectrum or logarithmic power spectrum, and the phase of the enhanced speech is replaced by the phase of the noisy speech, making the enhancement There is a certain mismatch between phase and amplitude in the speech, resulting in the "inconsistent spectrum" problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice enhancement network model and single-channel speech enhancement method and system
  • Voice enhancement network model and single-channel speech enhancement method and system
  • Voice enhancement network model and single-channel speech enhancement method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] In order to have a clearer understanding of the technical features, purposes and effects of the invention, the specific embodiments of the present invention are now described with reference to the accompanying drawings, in which the same reference numerals represent components with the same or similar structures but the same functions.

[0027] In this article, "schematic" means "serving as an example, example or illustration", and any illustration or implementation described as "schematic" should not be interpreted as a more preferred or more advantageous Technical solutions. In order to keep the drawings concise, the drawings only schematically show the parts related to this exemplary embodiment, and they do not represent the actual structure and true proportion of the product.

[0028] One aspect of the present invention provides a single-channel speech enhancement method, which is implemented through a speech enhancement network model. Such as figure 1 As shown, t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a single-channel speech enhancement method. The method is realized through a speech enhancement network model. The speech enhancement network model comprises an analysis layer,an encoder, a time convolution module, a decoder and a synthesis layer. According to the single-channel speech enhancement method provided by the invention, an analysis layer of quasi-short-time windowed Fourier transform based on convolution layer design and a synthesis layer of quasi-inverse short-time windowed Fourier transform are added, so that the characteristics of speech are better mined in a transform domain. Besides, the encoder and the decoder are constructed by adopting the gating convolution layer so as to expand the receptive field, transmission of information in the hierarchicalstructure can be better controlled, and the time convolution module is added between the encoder and the decoder so as to better learn the long-term memory characteristic of the speech, so that the speech enhancement effect can be enhanced. Meanwhile, the invention provides a single-channel speech enhancement system and a speech enhancement network model.

Description

technical field [0001] The invention relates to the technical field of speech signal processing, in particular to a single-channel speech enhancement method, a single-channel speech enhancement system and a speech enhancement network model. Background technique [0002] Speech enhancement refers to the use of audio signal processing technology and various algorithms to improve the intelligibility or overall perceptual quality of distorted speech signals, thereby further improving application effects in speech recognition, voice calls, hearing aids, and voiceprint recognition. Traditional single-channel speech enhancement algorithms include spectral subtraction, algorithms based on statistical models, and subspace algorithms. However, such algorithms usually need to make certain assumptions about the respective characteristics of the speech signal and noise and whether they are related to each other, which limits their enhancement performance. Most of the speech enhancement ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/02G10L19/008G10L19/02G10L25/30
CPCG10L19/008G10L19/0212G10L25/30
Inventor 康洪涛欧阳鹏
Owner BEIJING TSINGMICRO INTELLIGENT TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products