Speaker clustering method for distributed microphone

A clustering method and microphone technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as interference, high algorithm complexity, and unknown number of sound sources

Active Publication Date: 2011-05-25
北京华控智加科技有限公司
View PDF3 Cites 67 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the calculation of the microphone array system is sensitive to the sampling error between each device, so the synchronization requirements for audio data are very strict; in the ordinary multi-person multi-party conference scene, the number of sound sources is unknown, the position of the microphone is unknown, and the acoustic environment of the room is unknown. That is, sound data needs to be processed in scenarios where both temporal an

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speaker clustering method for distributed microphone
  • Speaker clustering method for distributed microphone
  • Speaker clustering method for distributed microphone

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0068] The present invention will be described in detail below in conjunction with the accompanying drawings.

[0069] refer to figure 1 , a speaker clustering method for distributed microphones, comprising the following steps:

[0070] The first step is to preprocess the signals collected by distributed microphones

[0071] refer to figure 2 , first preprocess the multi-channel sound source signals obtained by distributed microphones, first divide the multi-channel sound source signals into frames and perform fast Fourier transform (FFT) transformation, and then perform endpoint detection on the multi-channel sound source signals, and divide the signals into There are two types of sound source signals and non-sound source signals. The purpose of endpoint detection is to distinguish speech signals and non-speech signals from digital speech signals. Early methods based on energy and zero-crossing rates can accurately distinguish speech signals from noise. However, speech in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a speaker clustering method for a distributed microphone, which comprises the following steps: firstly performing pretreatment on signals acquired by the distributed microphone, further adopting the time delay estimation method for calculation against sound source signal fragments, getting a corresponding time delay estimation vector, then ruling out wrong data, performing speaker segmentation, and finally performing speaker clustering according to the speaker segmentation result. The distributed microphone is used as a signal acquisition and output device for calculating the time delay vector of the voice signal fragments, the time delay estimation precision is improved by ruling out the wrong data, and clustering algorithm is adopted for the time delay vector so as to respectively classify the voice signal fragments according to identities of speakers; furthermore, the device has the advantages of low price and convenience in use, and the speaker clustering method can be applied in a multi-person multi-party dialogue scene under a complex acoustic environment.

Description

technical field [0001] The invention belongs to the technical field of speech, and in particular relates to a speaker clustering method of distributed microphones. Background technique [0002] With the continuous development of network and communication technology, the use of existing multimedia technology, network and communication technology, distributed processing technology, etc. can realize multi-person and multi-party dialogue in complex acoustic environment scenarios. Traditional sound source input and recording equipment include head-mounted microphones, omnidirectional and directional single microphones, microphone arrays, etc. As a traditional sound source input and recording device, a single microphone has the advantages of small size and low price, but it does not have the ability to process environmental noise and locate the sound source; the microphone array is composed of multiple microphones arranged according to specific geometric positions. Spatial signal...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L17/00G10L15/08G10L19/02G10L17/02G10L19/022
Inventor 杨毅刘加
Owner 北京华控智加科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products