Voice separation method based on binaural sound source localization

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology for sound source localization and voice separation, which is applied in voice analysis, positioning, and measurement devices. It can solve problems such as large amount of calculation, error of true value, and large microphone array size, and achieve the effect of improving accuracy.

Active Publication Date: 2015-03-25

SOUTHEAST UNIV

View PDF6 Cites 35 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] At present, the mixing matrix estimated by the blind source speech separation technology needs to manually select the peak point, and there is an error with the real value, and its implementation conditions are difficult to meet the binaural speech separation model

However, the speech separation algorithm of multi-microphone array has problems such as large amount of calculation and large size of microphone array.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0033] The present invention will be further described below in conjunction with the accompanying drawings.

[0034] The present invention first carries out data training, and uses the mean value of ITD (Interaural Time Difference) and IID (Interaural Intensity Difference) of each orientation as the location characteristic clue of the location of the sound source, and establishes the location mapping model; the actual sound source location When the input is a two-channel acoustic signal, the input acoustic signal is transformed in the frequency domain first, and the ITD and IID parameters of each frame are calculated, and the ITD characteristic parameters are matched with the orientation feature model established by the training module one by one. Based on the Euclidean distance measure, the Azimuth screening, output candidate azimuths, Euclidean distance calculation between the IID feature parameters of the frame corresponding to all candidate azimuths and the IID feature para...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a voice separation method based on binaural sound source localization. Multiple sound sources are separated through data training, multiple-sound-source localization and voice separation according to sound source directions, and a separation voice of each sound source is obtained. The voice separation method can simulate the auditory sense of human ears based on the cocktail party effect of the human ears, the number of the sound sources and the directions of the sound sources can be accurately located, and an accurate mixed matrix is obtained through the information of the located sound source directions. Thus, the voice separation process is conducted, and the separation performance of the voice separation method is effectively improved.

Description

technical field [0001] The invention relates to a speech separation technology, in particular to a speech separation method based on binaural sound source localization. Background technique [0002] Speech separation is a special kind of speech enhancement method, which is only based on the observation data collected from binaural microphones (that is, the mixed speech signal) when the source speech signal and the transmission channel parameters (that is, the mixing process) are unknown. , the process of recovering or separating out independent source speech signals. [0003] At present, the mixing matrix estimated by the blind source speech separation technology needs to manually select the peak point, which has errors with the real value, and its implementation conditions are difficult to meet the binaural speech separation model. However, the speech separation algorithm of multi-microphone array has problems such as large amount of calculation and large size of microphon...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L21/0308G01S5/18

Inventor周琳李枭雄吴镇扬郭海燕

OwnerSOUTHEAST UNIV

Voice separation method based on binaural sound source localization

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology