Recording device clustering method based on Gaussian mean super vectors and spectral clustering

A technology of recording equipment and clustering method, applied in speech analysis, speech recognition, special data processing applications, etc., can solve the problems of recording equipment label information loss, lower recognition, equipment damage, etc.

Inactive Publication Date: 2017-07-14
SOUTH CHINA UNIV OF TECH
View PDF2 Cites 31 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In actual cases, the recognition result of the recording device may be affected due to factors such as loss of recording device label information, equipment damage, and uncertain device identification results, thereby reducing its recognition in court evidence collection; on the other hand, when a judge When faced with a large number of voice samples submitted, the primary concern may not be the category of the recording device, but to know which voice samples come from the same recording device
At this time, the problem that the judge has to face becomes: how to estimate the number of recording devices used to collect voice samples and combine the voice samples of the same recording device without knowing any prior information of the recording devices

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Recording device clustering method based on Gaussian mean super vectors and spectral clustering
  • Recording device clustering method based on Gaussian mean super vectors and spectral clustering
  • Recording device clustering method based on Gaussian mean super vectors and spectral clustering

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0099] figure 1 It is a structural flow chart of the present invention. Including the following steps:

[0100] 1. First, read in the voice sample recorded with the information of the recording device.

[0101] 2. Perform preprocessing on the read-in speech samples, including steps such as pre-emphasis, framing, and windowing of the speech signal. Preprocessing specifically includes the following steps:

[0102] 2.1. Pre-emphasis: set the transfer function of the digital filter to H(z)=1-αz -1 , where α is a coefficient and its value is: 0.9≤α≤1, and the read-in voice is pre-emphasized after passing through the digital filter;

[0103] 2.2, Framing: Set the frame length of the voice frame to 25 milliseconds, the frame shift to 10 milliseconds, and the number of sampling points corresponding to the frame length and frame shift to be N=0.025×f respectively s and S=0.01×f s , where f s For the voice sampling frequency, the read voice is divided into voice frames x t '(n),...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a recording device clustering method based on Gaussian mean super vectors and spectral clustering. The method comprises the steps that the Melch frequency cepstrum coefficient MFCC characteristic which characterizes the recording device characteristic is extracted from a speech sample; the MFCC characteristics of all speech samples are used as input, and a common background model UBM is trained through an expectation maximization EM algorithm; the MFCC characteristic of each speech sample is used as input, and UBM parameters are updated through a maximum posteriori probability MAP algorithm to acquire the Gaussian mixture model GMM of each speech sample; the mean vector of all Gaussian components of each GMM is spliced in turn to form a Gaussian mean super vector; a spectral clustering algorithm is used to cluster the Gaussian mean super vectors of all speech samples; the number of recording devices is estimated; and the speech samples of the same recording device are merged. According to the invention, the speech samples collected by the same recording device can be found out without knowing the prior knowledge of the type, the number and the like of the recording devices, and the application scope of the method is wide.

Description

technical field [0001] The invention relates to the technical fields of intelligent voice signal processing, pattern recognition and audio forensics, in particular to a recording device clustering method based on Gaussian mean supervector and spectral clustering. Background technique [0002] With the development of voice forensics technology, the identification of recording equipment based on voice samples has achieved good results, which is of great significance in judicial forensics. Voice evidence collected by recording equipment has become one of the common forms of evidence, and has been submitted to courts or other law enforcement agencies in large numbers, playing an important role in solving cases. [0003] In actual cases, the recognition result of the recording device may be affected due to factors such as loss of recording device label information, equipment damage, and uncertain device identification results, thereby reducing its recognition in court evidence co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/04G10L25/24G10L25/45G10L25/51G06F17/30
Inventor 李艳雄张雪李先苦张聿晗
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products