Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech enhancement method in music background based on non-negative matrix factorization

A non-negative matrix decomposition and speech enhancement technology, applied in speech analysis, instruments, etc., can solve problems such as unsatisfactory speech recognition effect, enhanced speech signal, and limited system application scenarios

Inactive Publication Date: 2015-07-01
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF4 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But for the most common non-stationary noise in daily life, there is no good solution to enhance the speech signal in such noise environment
This also makes the speech recognition effect in the actual environment unsatisfactory.
[0003] As a specific background noise, the music signal often appears together with the target voice signal, such as answering the phone while playing music while driving, or in some entertainment venues that play background music, etc., thereby polluting the pure voice signal and reducing the automatic voice signal. The recognition rate of the recognition system in the background of music
However, due to the non-stationary and spectral features of the music signal and the similar characteristics of the speech signal, it becomes very difficult to extract the speech signal from the music background
[0004] The existing algorithms for extracting speech signals from background music have achieved good results, but most of the algorithms do not fully consider the prior information of speech and background music signals, or only consider the sparsity characteristics of speech signals
On the other hand, most systems are speaker-dependent systems, which require the system to know the identity of the speaker in advance to achieve the best results, which also limits the application scenarios of the system to a certain extent

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech enhancement method in music background based on non-negative matrix factorization
  • Speech enhancement method in music background based on non-negative matrix factorization
  • Speech enhancement method in music background based on non-negative matrix factorization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The present invention will be described in detail below in conjunction with accompanying drawing and embodiment, also described the technical problem and beneficial effect that the technical solution of the present invention solves simultaneously, it should be pointed out that described embodiment is only intended to facilitate the understanding of the present invention, and It has no limiting effect on it.

[0037] First of all, in order to ensure the speaker-independent characteristics of the system while maintaining the performance of the system, and considering that there are many scenes in which voices and background music are mixed in the actual environment, the type of background music is known. For example, the background music in broadcasting is generally It is relatively soothing and brisk, while the background music in entertainment venues is generally more dynamic and intense. The speaker-independent background music type correlation is proposed as the purpos...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech enhancement method in a music background based on non-negative matrix factorization, and belongs to the field of voice analysis or synthetization, audio analysis or processing. The method provided by the invention comprises the steps of framing and windowing a mixed signal of music and voice, carrying out non-negative matrix factorization to an STFT (The Short-Time Fourier Transform) amplitude spectrum, wherein a basic matrix of the background music is obtained by training and is fixed in a decomposing process, the amplitude spectrum of the voice signal is synthesized according to the decomposing result, and then an enhanced voice signal is restored by combining a phase spectrum of a primary mixed signal. The test can be carried out under the different voice sparsity limitations and temporary continuity limitation, therefore the voice enhancement effect in the music background of the music can be effectively improved by improving the temporary continuity limitation of the background music.

Description

technical field [0001] The invention relates to a speech enhancement method, in particular to a speech enhancement method under music background considering the sparsity of speech signal and the limitation of temporal continuity of background music. In the field of speech analysis or synthesis, audio analysis or processing. Background technique [0002] Due to the interference of background noise, the performance of automatic speech recognition system will drop sharply with the decrease of signal-to-noise ratio. At present, there are good speech enhancement algorithms that can well enhance the speech signal under stationary noise, such as spectral subtraction, adaptive filtering method, minimum mean square error estimation method, etc., so as to improve the recognition rate of automatic speech recognition system under stationary noise . But for the most common non-stationary noise in daily life, there is no good solution to enhance the speech signal in such noise environme...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/0232
Inventor 谢湘屠明
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products