Recording device clustering method based on Gaussian mean super vectors and spectral clustering

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of recording equipment and clustering method, applied in speech analysis, speech recognition, special data processing applications, etc., can solve the problems of recording equipment label information loss, lower recognition, equipment damage, etc.

Inactive Publication Date: 2017-07-14

SOUTH CHINA UNIV OF TECH

View PDF2 Cites 31 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] In actual cases, the recognition result of the recording device may be affected due to factors such as loss of recording device label information, equipment damage, and uncertain device identification results, thereby reducing its recognition in court evidence collection; on the other hand, when a judge When faced with a large number of voice samples submitted, the primary concern may not be the category of the recording device, but to know which voice samples come from the same recording device

At this time, the problem that the judge has to face becomes: how to estimate the number of recording devices used to collect voice samples and combine the voice samples of the same recording device without knowing any prior information of the recording devices

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0099] figure 1 It is a structural flow chart of the present invention. Including the following steps:

[0100] 1. First, read in the voice sample recorded with the information of the recording device.

[0101] 2. Perform preprocessing on the read-in speech samples, including steps such as pre-emphasis, framing, and windowing of the speech signal. Preprocessing specifically includes the following steps:

[0102] 2.1. Pre-emphasis: set the transfer function of the digital filter to H(z)=1-αz -1 , where α is a coefficient and its value is: 0.9≤α≤1, and the read-in voice is pre-emphasized after passing through the digital filter;

[0103] 2.2, Framing: Set the frame length of the voice frame to 25 milliseconds, the frame shift to 10 milliseconds, and the number of sampling points corresponding to the frame length and frame shift to be N=0.025×f respectively s and S=0.01×f s , where f s For the voice sampling frequency, the read voice is divided into voice frames x t '(n),...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a recording device clustering method based on Gaussian mean super vectors and spectral clustering. The method comprises the steps that the Melch frequency cepstrum coefficient MFCC characteristic which characterizes the recording device characteristic is extracted from a speech sample; the MFCC characteristics of all speech samples are used as input, and a common background model UBM is trained through an expectation maximization EM algorithm; the MFCC characteristic of each speech sample is used as input, and UBM parameters are updated through a maximum posteriori probability MAP algorithm to acquire the Gaussian mixture model GMM of each speech sample; the mean vector of all Gaussian components of each GMM is spliced in turn to form a Gaussian mean super vector; a spectral clustering algorithm is used to cluster the Gaussian mean super vectors of all speech samples; the number of recording devices is estimated; and the speech samples of the same recording device are merged. According to the invention, the speech samples collected by the same recording device can be found out without knowing the prior knowledge of the type, the number and the like of the recording devices, and the application scope of the method is wide.

Description

technical field [0001] The invention relates to the technical fields of intelligent voice signal processing, pattern recognition and audio forensics, in particular to a recording device clustering method based on Gaussian mean supervector and spectral clustering. Background technique [0002] With the development of voice forensics technology, the identification of recording equipment based on voice samples has achieved good results, which is of great significance in judicial forensics. Voice evidence collected by recording equipment has become one of the common forms of evidence, and has been submitted to courts or other law enforcement agencies in large numbers, playing an important role in solving cases. [0003] In actual cases, the recognition result of the recording device may be affected due to factors such as loss of recording device label information, equipment damage, and uncertain device identification results, thereby reducing its recognition in court evidence co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L15/04G10L25/24G10L25/45G10L25/51G06F17/30

Inventor 李艳雄张雪李先苦张聿晗

Owner SOUTH CHINA UNIV OF TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Recording device clustering method based on Gaussian mean super vectors and spectral clustering

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology