Speaker clustering method for distributed microphone

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A clustering method and microphone technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as interference, high algorithm complexity, and unknown number of sound sources

Active Publication Date: 2011-05-25

北京华控智加科技有限公司

View PDF3 Cites 67 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the calculation of the microphone array system is sensitive to the sampling error between each device, so the synchronization requirements for audio data are very strict; in the ordinary multi-person multi-party conference scene, the number of sound sources is unknown, the position of the microphone is unknown, and the acoustic environment of the room is unknown. That is, sound data needs to be processed in scenarios where both temporal and spatial prior information are missing.

[0007] As a single microphone for traditional sound source input and recording equipment, it is cheap and simple in structure. The disadvantage is that it is susceptible to environmental interference and cannot locate the sound source; the traditional microphone array system has been extensively studied, and the main reason for not commercializing The hardware is expensive and the algorithm complexity is high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0068] The present invention will be described in detail below in conjunction with the accompanying drawings.

[0069] refer to figure 1 , a speaker clustering method for distributed microphones, comprising the following steps:

[0070] The first step is to preprocess the signals collected by distributed microphones

[0071] refer to figure 2 , first preprocess the multi-channel sound source signals obtained by distributed microphones, first divide the multi-channel sound source signals into frames and perform fast Fourier transform (FFT) transformation, and then perform endpoint detection on the multi-channel sound source signals, and divide the signals into There are two types of sound source signals and non-sound source signals. The purpose of endpoint detection is to distinguish speech signals and non-speech signals from digital speech signals. Early methods based on energy and zero-crossing rates can accurately distinguish speech signals from noise. However, speech in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a speaker clustering method for a distributed microphone, which comprises the following steps: firstly performing pretreatment on signals acquired by the distributed microphone, further adopting the time delay estimation method for calculation against sound source signal fragments, getting a corresponding time delay estimation vector, then ruling out wrong data, performing speaker segmentation, and finally performing speaker clustering according to the speaker segmentation result. The distributed microphone is used as a signal acquisition and output device for calculating the time delay vector of the voice signal fragments, the time delay estimation precision is improved by ruling out the wrong data, and clustering algorithm is adopted for the time delay vector so as to respectively classify the voice signal fragments according to identities of speakers; furthermore, the device has the advantages of low price and convenience in use, and the speaker clustering method can be applied in a multi-person multi-party dialogue scene under a complex acoustic environment.

Description

technical field [0001] The invention belongs to the technical field of speech, and in particular relates to a speaker clustering method of distributed microphones. Background technique [0002] With the continuous development of network and communication technology, the use of existing multimedia technology, network and communication technology, distributed processing technology, etc. can realize multi-person and multi-party dialogue in complex acoustic environment scenarios. Traditional sound source input and recording equipment include head-mounted microphones, omnidirectional and directional single microphones, microphone arrays, etc. As a traditional sound source input and recording device, a single microphone has the advantages of small size and low price, but it does not have the ability to process environmental noise and locate the sound source; the microphone array is composed of multiple microphones arranged according to specific geometric positions. Spatial signal...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L17/00G10L15/08G10L19/02G10L17/02G10L19/022

Inventor杨毅刘加

Owner北京华控智加科技有限公司

Speaker clustering method for distributed microphone

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology