Speaker identification method base on simple direct tolerance learning algorithm

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A metric learning and speaker technology, applied in the field of speaker recognition, can solve the problems of large correlation and redundancy, and achieve the effect of easy acquisition, good recognition effect and fast speed

Inactive Publication Date: 2016-09-07

JIANGXI NORMAL UNIV

View PDF1 Cites 9 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

As the dimension of data increases, there is often a large correlation and redundancy between these high-dimensional data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0043] A speaker recognition method based on a simple direct metric learning algorithm according to an embodiment of the present invention will be described in detail below with reference to the accompanying drawings. refer to figure 1 , figure 1 A flowchart of an embodiment of the method of the present invention is shown, the method includes the following steps:

[0044] In step S110, collect the speech samples of a plurality of speakers, and extract the i-vector in all samples;

[0045] In step S120, LDA or WCCN method is used to perform channel compensation to process the i-vectors in all samples, and perform length regularization to form a training sample set;

[0046] In step S130, construct the i-vector based on the training sample set and the similar sample pair set and the non-similar sample pair set of speaker identity;

[0047]In step S140, the KISS algorithm is used to train the similar sample pair set and the non-similar sample pair set to obtain the metric matr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a speaker identification method base on a simple direct tolerance learning algorithm. The method comprises the following steps: acquiring voice samples of multiple speakers, extracting i-vectors of all the samples, performing channel compensation processing by use of an LDA or WCCN method, performing length normalizing, and forming a training sample set; according to the i-vectors of the training sample set and speaker identity, constructing a similar sample pair set and a non-similar sample pair set; by use of a KISS algorithm, obtaining a tolerance matrix by performing training on the similar sample pair set and the non-similar sample pair set; and for two pieces of new voice, their i-vectors are extracted firstly, the channel compensation processing is carried out by use of the LDA or WCCN method, the length normalizing is performed, by use of the previously calculated tolerance matrix, a Mahalanobis distance between the two i-vectors is calculated and compared with a threshold, and thus whether the two pieces of new voice belong to the same speaker is determined. According to the invention, the obtained Mahalanobis distance tolerance matrix can better truly reflect similarities and distinctions of a sample space so as to improve the performance of a speaker identification system.

Description

technical field [0001] The invention is a speaker recognition method based on a simple and direct metric learning algorithm, which can be widely used in speaker recognition, pattern recognition, metric learning, machine learning and other fields. Background technique [0002] Speaker recognition (Speaker Recognition, SR), also known as voiceprint recognition, is a technology that identifies the speaker's identity by processing and analyzing the speaker's voice. How to effectively measure the similarity between speaker speech samples is one of the hot issues in the field of speaker recognition research. In the field of pattern recognition, there are many methods to measure the similarity between samples. The more commonly used methods are distance scoring methods, such as cosine distance scoring and Mahalanobis distance scoring. [0003] The cosine distance scoring method measures the similarity between samples by calculating the cosine value of the included angle in the inn...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L17/00G10L17/04

CPCG10L17/00G10L17/04

Inventor雷震春杨印根朱明华

OwnerJIANGXI NORMAL UNIV

Speaker identification method base on simple direct tolerance learning algorithm

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology