Unlock instant, AI-driven research and patent intelligence for your innovation.

Short Speech Speaker Recognition Method Based on Basis State Vector Weighting

A speaker recognition and state vector technology, applied in speech analysis, instruments, etc., can solve the problems of limiting speaker recognition performance, model holes, and insufficient adaptation of GMM mixing degree, so as to overcome model holes and improve performance, the effect of reducing degrees of freedom

Active Publication Date: 2016-04-20
TSINGHUA UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In the current GSV-SVM speaker recognition system, often due to the short duration of the speech segment of the training speaker or the short duration of the test speech segment, some GMM mixing degrees cannot be obtained in the process of adaptive GMM mean supervector Fully adaptive, it is easy to cause the problem of "model hole", which limits the performance of speaker recognition in short speech

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Short Speech Speaker Recognition Method Based on Basis State Vector Weighting
  • Short Speech Speaker Recognition Method Based on Basis State Vector Weighting
  • Short Speech Speaker Recognition Method Based on Basis State Vector Weighting

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] Embodiments of the present invention are described in detail below, and examples of the embodiments are shown in the drawings, wherein the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.

[0022] figure 1 It is a flow chart of a method for short speech speaker recognition based on weighting of base state vectors according to an embodiment of the present invention. Figure 4 It is a schematic diagram of a short-speech speaker recognition method based on weighting of basic state vectors according to an embodiment of the present invention. like figure 1 As shown, the short speech speaker recognition method based on the weighting of the basic state vector according to the embodiment of the present invention compri...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a short voice speaker recognizing method based on base state vector weighting. The method comprises the following steps of: acquiring multiple voice data with text marks and training the multiple voice data to obtain a state-layer-clustered hidden markov model; decoding the data in speaker recognition according to the state-layer-clustered hidden markov model to obtain base state marks of the data; training a universal background model of the base state according to the base state marks of the data, and generating a base state mean super-vector and a base state weight super-vector from the model after MAP self-adaption; and according to the base state mean super-vector and the base state weight super-vector, implementing model training to a speaker and testing and estimating the identity of the short voice speaker. According to the method provided by the embodiment of the invention, fine modeling of the base state layer can be realized, and the problem of 'model hole' easily caused by the traditional method is overcome by effective weighting, so that the freedom of modeling is effectively lowered and simultaneously the recognition performance of the speaker is enhanced.

Description

technical field [0001] The invention relates to the technical field of speech recognition, in particular to a short speech speaker recognition method based on base state vector weighting. Background technique [0002] Speaker recognition technology refers to a biometric identification technology that uses a machine to automatically identify the speaker's identity information from the speech signal to be tested. This technology is widely used in voice-based speaker identification, public security criminal investigation, court evidence identification, national security and other fields. [0003] Common speaker recognition systems mainly include VQ (Vector Quantization), GMM-UBM (Gaussian Mixture Model-Universal Background Model), GSV-SVM (Gaussian Mean Supervector-Support Vector Machine), JFA (Joint Factor Analysis), IVEC ( authentication vector) and so on. Among them, the GSV-SVM system is superior to other systems due to its flexibility and robustness, and is currently wid...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L17/04G10L17/16
Inventor 栗志意张卫强刘巍巍刘加
Owner TSINGHUA UNIV