Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for quickly recognizing speaker

A speaker recognition and speaker technology, applied in the field of speaker recognition, can solve the problems of large training samples, unbalanced positive and negative samples, and long training time.

Active Publication Date: 2012-09-12
JILIN UNIV
View PDF6 Cites 27 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the one-to-many implementation method is simple and intuitive, it needs to construct a small number of two-class support vector machines, but it uses all other class samples as negative samples, resulting in an imbalance of positive and negative samples, too large negative training samples, and longer training time
Especially when new categories are added, the negative samples change, and all support vector machines need to be retrained, resulting in poor scalability of the system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for quickly recognizing speaker
  • Method for quickly recognizing speaker
  • Method for quickly recognizing speaker

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0061] specific implementation plan

[0062] The present invention comprises the following steps:

[0063] (1) Preprocessing of speech signals, such as figure 1 As shown, the process includes: input the voice signal, perform pre-filtering, endpoint detection, pre-emphasis, and framing on it, and the pre-emphasis coefficient is 0.95; use Hamming window for framing, in which the window width is 256 sampling points, and the window shift is 128 a sampling point;

[0064] (2) Feature parameter extraction process, such as figure 2 As shown, the process consists of two steps:

[0065] (1) Extract the Mel-Frequency Cepstrum Coefficient (Mel-Frequency Cepstrum Coefficient) MFCC feature parameter as the first feature parameter of the speaker , for dimension matrix, is the number of preprocessed frames of speech data, is the dimension of the feature parameter;

[0066] (2) Using the first characteristic parameter Generate the GMM supervector as the characteristic paramet...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method for quickly recognizing a speaker and belongs to speaker recognition methods. The method comprises the following steps of: combining a Gaussian mixture model, and taking the supervector of the Gaussian mixture model as the feature parameter of the speaker; taking the supervector of the Gaussian mixture model as input, designing a one-class support vector machine classifier; and training N classifiers corresponding to N speakers, thus obtaining a voice sample of one speaker from one classifier. By utilizing the method, the speaker recognition speed is increased; for every new registered speaker, only one one-class support vector machine classifier is trained for the new speaker, so that the speaker recognition system has good extensibility.

Description

technical field [0001] The invention relates to a speaker recognition method. Background technique [0002] Speaker recognition, also known as voiceprint recognition, is a biometric technology that distinguishes speakers by voice for identification and authentication. At present, the speaker recognition method based on support vector has become a mainstream speaker recognition method. [0003] For the support vector machine, it is a two-class classifier. When it is applied to the speaker recognition system, it needs to complete the transformation from two classes to multi-class classification. The two-class support vector machine classifier can implement multi-class classification in two ways, one is a one-to-one implementation method. This method is a more commonly used multi-class classification method. The support vector machine multiclass classifier adopted in the patent CN1787075 and the patent CN102201237A is implemented in this way. For one-to-one multi-class clas...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L17/00G10L15/02G10L15/08G10L17/02G10L17/06G10L17/14
Inventor 林琳金焕梅陈虹姜宏孙晓颖陈建魏晓丽
Owner JILIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products