Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Fast speech blind source separation method based on frequency point selection under binaural distance

A fast speech and blind source separation technology, applied in the direction of speech analysis, instruments, etc., can solve the problem that the separation algorithm of frequency point selection standard and unselected frequency points cannot be directly applied, the separation algorithm of unselected frequency points cannot work normally, and delays Inaccurate parameter TDOA and other issues, to achieve the effect of reduced computational complexity, reduced computing time, and easy access

Active Publication Date: 2017-05-10
SHANDONG UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] (1) Under the condition of small-spacing microphones, the attenuation and delay experienced by the acoustic signal from the same sound source to the two microphones are approximately the same, so the frequency point selection standard based on this model and the separation algorithm for unselected frequency points cannot be applied directly
[0009] (2) When the distance between the microphones increases, spatial aliasing may occur at high frequency points, and the delay parameters or TDOA extracted from the separation matrix of these frequency points are inaccurate, resulting in the separation algorithm of unselected frequency points not working properly

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Fast speech blind source separation method based on frequency point selection under binaural distance
  • Fast speech blind source separation method based on frequency point selection under binaural distance
  • Fast speech blind source separation method based on frequency point selection under binaural distance

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] The present invention will be further described below in conjunction with the accompanying drawings and embodiments.

[0050] blind source separation model

[0051] In an actual environment, the signal received by the microphone is not only the attenuated signal reached by the direct path, but also the signal reflected by the multipath. The path that a signal takes from a sound source to a microphone is usually described by a hybrid filter of finite length. define i (n) is the signal from sound source i (1≤i≤M, M is the number of sound sources), x j (n) is the signal received by microphone j (1≤j≤N, N is the number of microphones), a ji (n) is the room impulse response from sound source i to microphone j, L is the length of the room impulse response, x j (n) is expressed as:

[0052]

[0053] Under the condition of convolutional mixing, the mixing filter may be thousands of knots long. Using the time-domain blind source separation method, it is necessary to esti...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a quick speed blind source separation method based on frequency point selection under binaural distance. Firstly two problems in using a hybrid signal covariance matrix determinant as a frequency point selection standard are analyzed and a settlement solution is provided. A preliminary selecting solution is provided for selecting some frequency points for performing frequency domain ICA and ordering, and then finally selected frequency points and unselected frequency points are determined through frequency point screening. Furthermore an amplitude uncertainty problem is settled on the selected frequency points for finishing separation. The unselected frequency points are separated by means of a separating matrix which is established by means of a relative mixing parameter that is extracted by a separating matrix on the selected frequency points. The frequency points with relatively high separating performance can be selected through the preliminary selecting solution and a frequency point screening solution secondary selecting process. A separating algorithm for the unselected frequency points has advantages of no limitation by microphone distance, capability of restricting the frequency points for extracting the relative mixing parameter in a frequency range without space overlapping, and no influence of an amplitude uncertainty problem to a relative damping parameter.

Description

technical field [0001] The invention relates to a fast voice blind source separation method based on frequency point selection under binaural distance. Background technique [0002] The cocktail party problem refers to how we can find the target object from many people who are speaking at a noisy banquet, that is, how to separate specific sound sources only from the sound obtained by linear mixing of some unknown sound sources or all sound sources. Blind source separation (BSS) is a technique to separate the original sound source signal from the mixed sound, and it is used to solve the cocktail party problem. This technology has received attention since the 1990s, and there have been thousands of documents describing the research work in this area. BSS has been widely used in many fields to solve the problem of separating signals, such as speech enhancement in speech recognition, communication field, analysis of ECG and EEG signals. [0003] BSS is a very challenging task...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L21/0232G10L21/0272G10L21/0208
CPCG10L21/0208G10L21/0232G10L21/0272G10L2021/02087
Inventor 魏莹闫莉莉勾多多马彤韩凯琳
Owner SHANDONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products