Method for identifying speaker unrelated to text based on weighted Bayes mixture model

A speaker recognition and hybrid model technology, applied in the field of speaker recognition, can solve the problems of low recognition accuracy, overfitting of training data, and no introduction of prior information.

Active Publication Date: 2014-12-03
INFORMATION & COMM BRANCH OF STATE GRID JIANGSU ELECTRIC POWER
View PDF3 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

First of all, the traditional GMM training process is based on the maximum likelihood criterion, which is prone to overfitting or underfitting to the training data.
Second, traditional GMM-based text-independent speaker recognition only considers observational data without introducing prior information
The above problems often make the recognition accuracy of the traditional GMM-based text-independent speaker recognition system low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for identifying speaker unrelated to text based on weighted Bayes mixture model
  • Method for identifying speaker unrelated to text based on weighted Bayes mixture model
  • Method for identifying speaker unrelated to text based on weighted Bayes mixture model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] The technical solutions of the present invention will be further elaborated below in conjunction with the accompanying drawings and embodiments.

[0057] Such as figure 1 As shown, the present invention provides a kind of text-independent speaker recognition method based on weighted Bayesian mixed model, and the method comprises the following steps:

[0058] The first step: preprocessing of the speech signal

[0059] (1) Sampling and quantization

[0060] For each segment of speech signal y in the data set used for training and used for recognition a (t) Sampling to obtain the amplitude sequence y(n) of the digital voice signal. The y(n) is quantized and coded by pulse code modulation (PCM) technology, so as to obtain the quantized value representation form y'(n) of the amplitude sequence. Here, the accuracy of sampling and quantization is determined according to the requirements of the speaker recognition system applied in different environments. For most speech s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for identifying a speaker unrelated to text based on a weighted Bayes mixture model. The method comprises that a voice signal set used for training is pre-processed and feature of the voice signal set is extracted, the training set is described via the weighted Bayes mixture model in the training process, parameter values and random variable distribution in the weighted Bayes mixture model are estimated via training, and thus, the weighted Bayes mixture model corresponding to each speaker is obtained. During identification, the marginal likelihood values of the trained weighted Bayes mixture models corresponding to the speakers are calculated via identification voices after preprocessing and feature extraction, and the maximal marginal likelihood corresponding to the speaker is used as an identification result. The method can effectively improve the correct identification rate of a text-related speaker identification system, avoids the problems of over-fitting and under-fitting that tend to occur in a traditional method, and enable that the relative weight of prior information and training data is easier and more flexible to control.

Description

technical field [0001] The invention relates to a text-independent speaker recognition method based on a weighted Bayesian mixture model, which belongs to the technical field of speech signal processing. Background technique [0002] Speaker recognition plays an increasingly important role in access control, credit card transactions, and court evidence. Its goal is to correctly determine the speech to be recognized as belonging to one of the multiple reference persons in the speech library. [0003] Currently, in the text-independent speaker recognition methods, the method based on Gaussian mixture model (ie: GMM) is the most widely used. Because of its advantages of high recognition rate, simple training, and small requirement for training data, it has become the mainstream method for speaker recognition that has nothing to do with text. Since GMM has a good ability to represent the distribution of data, as long as there are enough states and enough training data, GMM can ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L17/04G10L17/02
Inventor 魏昕周亮赵力陈建新
Owner INFORMATION & COMM BRANCH OF STATE GRID JIANGSU ELECTRIC POWER
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products