Speaker recognition method under linear transformation of identity vector x-vector

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
An xl-vector, speaker recognition technology, applied in the field of speaker recognition, can solve the problems of recognition environment impact, large memory requirements, computing speed impact, etc., to achieve the effect of improving recognition performance

Active Publication Date: 2019-07-23

DONGHUA UNIV

View PDF9 Cites 3 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Different papers have studied how to improve the system performance under the x-vector model. Studies have shown that superposition of i-vector and x-vector models or fusion of PLDA scores can improve system performance. However, this method is designed for two systems and requires a lot of The memory requirements, while the calculation speed will also be affected

Subsequently, more research improved the robustness of x-vector through data expansion, but this method is affected by the recognition environment

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0027] Below in conjunction with specific embodiment, further illustrate the present invention. It should be understood that these examples are only used to illustrate the present invention and are not intended to limit the scope of the present invention. In addition, it should be understood that after reading the teachings of the present invention, those skilled in the art can make various changes or modifications to the present invention, and these equivalent forms also fall within the scope defined by the appended claims of the present application.

[0028] The embodiment of the present invention discloses a method of speaker recognition technology under the linear transformation of the identity vector x-vector, such as figure 1 shown, including the following steps:

[0029] Step 1. Feature extraction—the present invention uses Mel Frequency Cepstral Coefficients (MFCC) as the feature of the speaker. The Mel frequency scale roughly corresponds to the logarithmic distribut...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a speaker recognition method under linear transformation of an identity vector x-vector. The method comprises the main steps that feature extraction is carried out on speech,and the identity vector x-vector and identity vector i-vector of the speech are extracted respectively; parallel factor analyzer training is carried out by using the x-vector and i-vector of a same speaker; a parameter corresponding to the x-vector in a parallel factor analyzer is selected, and on the basis of the parameter, linear transformation is carried out on the identity vector x-vector to obtain an xl-vector; the new identity vector xl-vector is trained to obtain a PLD model; feature extraction and x-vector extraction are carried out on the to-be-tested speech, the identity vector x-vector is input into a linear transducer obtained in the training stage to obtain the new identity vector xl-vector, and finally, the identity vector xl-vector is input into the PLD model obtained in thetraining stage, thereby obtaining a final result. The speaker recognition method under the linear transformation of the identity vector x-vector has the advantages of improving the recognition performance of speaker recognition while ensuring that the memory requirements and computing speed are similar to those of a baseline system.

Description

technical field [0001] The present invention relates to speaker identification technology in biometric identification, and more specifically relates to a speaker identification technology under linear transformation of identity vector x-vector. Background technique [0002] Voice is the most direct and convenient way for human beings to communicate. It has attracted the attention of various research institutions for its unique convenience, economy, accuracy and other advantages. The research on speech signal processing is of great significance to the promotion of human-computer interaction and the development of artificial intelligence. For this reason, the related fields of speech signal processing, such as speech recognition, speech coding, speech synthesis, speaker recognition, etc., have received more and more attention and theoretical research. Speaker recognition, also known as voiceprint recognition, aims to authenticate the identity of each speaker based on the uniq...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L25/18G10L25/24G10L25/30G10L25/60G10L15/02G10L15/06

CPCG10L15/02G10L15/063G10L25/18G10L25/24G10L25/30G10L25/60G10L2015/0635

Inventor徐珑婷张光林赵萍张磊季云云

OwnerDONGHUA UNIV

Speaker recognition method under linear transformation of identity vector x-vector

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology