Unlock instant, AI-driven research and patent intelligence for your innovation.

Speaker identification method and system

A speaker recognition and speaker technology, applied in the field of speaker recognition methods and systems, can solve the problem of not considering the speaker label information in the training data, and achieve the effect of improving the recognition performance

Active Publication Date: 2015-02-11
INST OF ACOUSTICS CHINESE ACAD OF SCI +1
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there are certain deficiencies in the total variation factor analysis technique. On the one hand, the speaker’s annotation information in the training data is not considered in the training process of the total variation space; Can reflect the overall structure of the data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speaker identification method and system
  • Speaker identification method and system
  • Speaker identification method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0014] figure 1 It is a block diagram of a speaker recognition algorithm based on neighborhood-preserving embedded factor analysis, which describes the core components of a speaker recognition algorithm based on neighborhood-preserving embedded factor analysis. It mainly consists of several parts: GMM mean supervector, principal component analysis ( PCA), neighborhood preserving embedding (NPE) factor analysis, support vector machine (SVM) modeling and scoring. figure 2 It is a detailed flowchart of speaker recognition based on an embodiment of neighborhood-preserving embedding factor analysis.

[0015] Combine below figure 1 as well as figure 2 The specific implementation manner of the embodiment of the present invention is described in further detail:

[0016] The training process of the neighborhood-preserving embedding space matrix includes the following steps:

[0017] 1) Feature extraction is performed on the training speech data of principal component analysis and...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a speaker identification method which comprises the following steps that a neighbourhood preserving embedding space matrix is obtained through training; speaker identification is performed on the basis of the neighbourhood preserving embedding space matrix; the speaker identification based on the neighbourhood preserving embedding space matrix comprises the following steps that principal component analysis (PCA) is performed on a mean super vector X of each gaussian mixture model (GMM), and then a vector W is obtained through dimensionality reduction; the neighbourhood preserving embedding space matrix is used for mapping each vector W, and then a vector W' is obtained; the vector W' obtained by mapping is used as an input feature of a support vector machine (SVM), so as to perform back-end classification modeling; and grading is performed with the help of the SVM, and a speaker is identified in accordance with the grading result. According to the speaker identification method disclosed by the embodiment of the invention, a novel factor analysis technique based on neighborhood preserving embedding (NPE) is adopted, the defects of the existing gross variation factor analysis technique can be effectively overcome, and the speaker identification performance can be further improved.

Description

technical field [0001] The invention belongs to the technical field of speech recognition, in particular, the invention relates to a speaker recognition method and system. Background technique [0002] Speaker recognition technology, in simple terms, is a technology that automatically distinguishes speakers based on voice, so as to identify and authenticate speakers. Speaker recognition has always been of great importance in national security. In addition, with the development of communication and Internet technology, speaker recognition technology has also begun to be applied in multimedia information processing and retrieval. [0003] In the current laboratory environment, because the voice transmission channel is relatively single and the signal-to-noise ratio is high, in this case, the speaker recognition system can achieve good recognition performance. However, in practical applications, the complex and changeable speech environment, such as environmental noise and ch...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L17/12
Inventor 周若华颜永红梁春燕杨琳
Owner INST OF ACOUSTICS CHINESE ACAD OF SCI