Method for identifying speaker unrelated to text based on weighted Bayes mixture model

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speaker recognition and hybrid model technology, applied in the field of speaker recognition, can solve the problems of low recognition accuracy, overfitting of training data, and no introduction of prior information.

Active Publication Date: 2014-12-03

INFORMATION & COMM BRANCH OF STATE GRID JIANGSU ELECTRIC POWER

View PDF3 Cites 3 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

First of all, the traditional GMM training process is based on the maximum likelihood criterion, which is prone to overfitting or underfitting to the training data.

Second, traditional GMM-based text-independent speaker recognition only considers observational data without introducing prior information

The above problems often make the recognition accuracy of the traditional GMM-based text-independent speaker recognition system low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0056] The technical solutions of the present invention will be further elaborated below in conjunction with the accompanying drawings and embodiments.

[0057] Such as figure 1 As shown, the present invention provides a kind of text-independent speaker recognition method based on weighted Bayesian mixed model, and the method comprises the following steps:

[0058] The first step: preprocessing of the speech signal

[0059] (1) Sampling and quantization

[0060] For each segment of speech signal y in the data set used for training and used for recognition a (t) Sampling to obtain the amplitude sequence y(n) of the digital voice signal. The y(n) is quantized and coded by pulse code modulation (PCM) technology, so as to obtain the quantized value representation form y'(n) of the amplitude sequence. Here, the accuracy of sampling and quantization is determined according to the requirements of the speaker recognition system applied in different environments. For most speech s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a method for identifying a speaker unrelated to text based on a weighted Bayes mixture model. The method comprises that a voice signal set used for training is pre-processed and feature of the voice signal set is extracted, the training set is described via the weighted Bayes mixture model in the training process, parameter values and random variable distribution in the weighted Bayes mixture model are estimated via training, and thus, the weighted Bayes mixture model corresponding to each speaker is obtained. During identification, the marginal likelihood values of the trained weighted Bayes mixture models corresponding to the speakers are calculated via identification voices after preprocessing and feature extraction, and the maximal marginal likelihood corresponding to the speaker is used as an identification result. The method can effectively improve the correct identification rate of a text-related speaker identification system, avoids the problems of over-fitting and under-fitting that tend to occur in a traditional method, and enable that the relative weight of prior information and training data is easier and more flexible to control.

Description

technical field [0001] The invention relates to a text-independent speaker recognition method based on a weighted Bayesian mixture model, which belongs to the technical field of speech signal processing. Background technique [0002] Speaker recognition plays an increasingly important role in access control, credit card transactions, and court evidence. Its goal is to correctly determine the speech to be recognized as belonging to one of the multiple reference persons in the speech library. [0003] Currently, in the text-independent speaker recognition methods, the method based on Gaussian mixture model (ie: GMM) is the most widely used. Because of its advantages of high recognition rate, simple training, and small requirement for training data, it has become the mainstream method for speaker recognition that has nothing to do with text. Since GMM has a good ability to represent the distribution of data, as long as there are enough states and enough training data, GMM can ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L17/04G10L17/02

Inventor魏昕周亮赵力陈建新

OwnerINFORMATION & COMM BRANCH OF STATE GRID JIANGSU ELECTRIC POWER

Method for identifying speaker unrelated to text based on weighted Bayes mixture model

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology