Check patentability & draft patents in minutes with Patsnap Eureka AI!

Speech enhancement method and system

A technology of speech enhancement and noisy speech, applied in speech analysis, instruments, etc., can solve the problems of indistinguishability, poor enhancement effect, audio splicing distortion, etc., to achieve the effect of improving the enhancement effect and reducing the number of

Pending Publication Date: 2021-11-05
COMMUNICATION UNIVERSITY OF CHINA
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, in the way of speech enhancement based on the audio time-domain waveform, due to the dense sampling points of the time-domain signal, for long audio, it is difficult for the network to learn all the information of the entire audio, so it is necessary to divide the signal into frames, so that the network can Frame learning, and finally splicing each frame together, which will cause serious distortion at the audio splicing and poor enhancement effect
However, based on short-time Fourier transform processing, the signal needs to be a stationary signal. For non-stationary signals, the frequency components of the signal are different at different times, which cannot be distinguished. Therefore, the scope of application of this method is limited. The enhancement effect of stationary signal is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech enhancement method and system
  • Speech enhancement method and system
  • Speech enhancement method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0057] The terms "first" and "second" in the description and claims of the present invention and the above drawings are used to distinguish different objects, rather than to describe a specific order. Furthermore, the terms "comprising" and "having", and any variations thereof, are intended to cover a non-exclusive inclusion. For example, a process, method, system, product or apparatus comprising a series of steps or units is not defi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech enhancement method and system. The method comprises the following steps: acquiring a noisy speech signal; performing wavelet decomposition on the noisy voice signal to obtain a plurality of noisy sub-bands; inputting each sub-band with noise into a speech enhancement model to obtain an enhanced sub-band corresponding to each sub-band with noise; and performing wavelet synthesis on the plurality of enhancement sub-bands to obtain an enhanced voice signal. According to the invention, the length of the signal can be reduced layer by layer through discrete wavelet change, the number of sampling points is reduced, the method is more suitable for non-stationary signals such as voice, and the voice signal enhancement effect is improved.

Description

technical field [0001] The invention relates to the technical field of audio processing, in particular to a voice enhancement method and system. Background technique [0002] In practical applications, speech signals are easily disturbed by noise, and it is necessary to suppress noise interference through speech enhancement technology, reduce the impact of noise on speech, and extract useful speech signals from noisy speech. The current speech enhancement technology is mainly a speech enhancement method based on deep learning, that is, two kinds of audio features are used as the input of the network, one of which is based on the audio time-domain waveform for speech enhancement, and the other method is to perform speech enhancement first. Some signal preprocessing means such as short-time Fourier transform, and then do noise reduction processing. [0003] However, in the way of speech enhancement based on the audio time-domain waveform, due to the dense sampling points of t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L21/0208G10L21/0232G10L25/18G10L25/30G10L25/45
CPCG10L21/0208G10L21/0232G10L25/30G10L25/45G10L25/18
Inventor 王雨田王童王晖赵海博
Owner COMMUNICATION UNIVERSITY OF CHINA
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More