Supercharge Your Innovation With Domain-Expert AI Agents!

Speech enhancement method based on Kullback-Leibler difference

A voice enhancement and difference technology, applied in the field of wireless telephone communication, scene recording and military eavesdropping, can solve the problems of relying on experience selection, poor noise reduction performance, and reducing the amount of calculation

Active Publication Date: 2019-02-15
SHANGHAI UNIV
View PDF13 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] For the deficiencies in the prior art, the purpose of the present invention is to propose a speech enhancement method based on the Kullback-Leibler difference, and determine the optimal decomposition times of the method, which overcomes the poor noise reduction performance of the traditional method under the low signal-to-noise ratio , The shortcomings of too many parameters relying on empirical selection, which significantly improves the noise reduction performance in the case of low signal-to-noise ratio, adaptively selects parameters without relying on human factors, and selects the best decomposition times, reducing the amount of calculation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech enhancement method based on Kullback-Leibler difference
  • Speech enhancement method based on Kullback-Leibler difference
  • Speech enhancement method based on Kullback-Leibler difference

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] In order to better understand the technical scheme of the present invention, the present invention will be further described in detail below in conjunction with the accompanying drawings:

[0023] For the procedure of this method, see figure 1 , a speech enhancement method based on the Kullback-Leibler difference, using the KL difference selection principle to select an atom with a modulus less than 1 that makes the KL difference value the smallest in each decomposition, and construct a rational orthogonal basis function through the selected atoms , and then reconstruct the pure speech signal by combining the basis function and the weight coefficient to complete the speech enhancement. In addition, the optimal number of decompositions is selected according to the cost function. The specific implementation steps are as follows:

[0024] 1), the original speech signal is divided into frames, and the length of each frame obtained is about 20-30ms, and the signal in this ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech enhancement method based on Kullback-Leibler difference and determines the optimal decomposition number of the method. The method is specifically characterized in thatfirstly, a noisy speech signal is processed into frames, each frame of signal is processed separately, and optimal atoms a1, a2,...and ak are selected through the KL difference principle; secondly, the best rational orthogonal base Bk is constructed according to the atoms and combined with a weight coefficient to obtain reconstruction signals fk<^>; thirdly, the reconstruction signals obtained through N times of decomposition are superposed to obtain a final denoised speech signal; and lastly, the RMSE attenuation difference is taken as a cost function to determine the optimal decomposition number. The method is advantaged in that the KL difference selection principle is utilized, the best atoms are adaptively selected, the base function is further constructed, the uncertainty caused by artificial parameter selection is greatly reduced, better noise reduction performance at the low SNR is achieved, secondly, the optimal decomposition number is determined according to the cost function, computational complexity is effectively reduced, and the method can be widely applied to fields such as speech noise reduction.

Description

technical field [0001] The invention relates to a speech enhancement method based on Kullback-Leibler (KL) difference, which is applied in the technical fields of wireless telephone communication, scene recording, military wiretapping and the like. Background technique [0002] The purpose of speech signal processing is to obtain certain speech characteristic parameters for efficient transmission or storage, or to achieve a certain purpose through a certain processing operation, such as artificial speech synthesis, identifying the speaker, and the content of the speech. Speech enhancement is an important aspect of speech signal processing. One of the main purposes of speech enhancement is to extract the pure original speech signal as much as possible from the speech signal mixed with noise. A pure speech signal is almost impossible, especially in the case of low signal-to-noise ratio. In this case, there are two main purposes of speech enhancement: one is to improve speech...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L21/0208G10L25/27
CPCG10L21/0208G10L25/27Y02T90/00
Inventor 王慧黄青华张丽丽柯晨光
Owner SHANGHAI UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More