Probability linear speaker-distinguishing identifying method based on priori knowledge structured covariance

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology for speaker identification and linear identification, which is applied in the field of speaker identification based on probabilistic linear identification analysis based on prior knowledge normalization and covariance, which can solve problems such as erasure and achieve the effect of improving the effect.

Active Publication Date: 2015-12-09

SYSU CMU SHUNDE INT JOINT RES INST +1

View PDF4 Cites 14 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0007] However, the limitation of the above algorithm framework is that the frame length and signal-to-noise ratio of each speech are different, and the probabilistic linear discriminant analysis model trained by using the global covariance matrix to describe the residual distribution will obviously be different from the The real model has a certain deviation, and will erase the useful information inherent in each sentence that can help improve recognition performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0031] The drawings are for illustrative purposes only, and should not be construed as limitations on this patent; in order to better illustrate this embodiment, some parts in the drawings will be omitted, enlarged or reduced, and do not represent the size of the actual product;

[0032] For those skilled in the art, it is understandable that some well-known structures and descriptions thereof may be omitted in the drawings. The technical solutions of the present invention will be further described below in conjunction with the accompanying drawings and embodiments.

[0033] figure 1 In the present invention, the inherent physical information of the training speech, such as duration, signal-to-noise ratio, and scoring information obtained from other models, is used as a regularization process for the prior knowledge of this model training. In this embodiment, the duration of the training speech is selected as the prior knowledge. Covariance regularization.

[0034] figure ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a probability linear speaker-distinguishing identifying method based on priori knowledge structured covariance, which is capable of structuring a covariance hypothesis and iterative process of a probability linear identifying-analyzing model based on random useful information related to training voice; and finally, a probability linear identifying-analyzing model that can be more distinctive and can reflect the real situation can be trained; and at the same time, two structuring coefficients are introduced to make the model adjustable and can be self-adaptive to be optimum aiming to various different structuring information. By adopting the model trained by this method provided herein, compared with the traditional model, the evaluating effect of identifying the speaker on the same dataset is improved significantly; and the equal error rate (EER) and the minimum detect error cost function (norm minDCF) can be lowered 10 percent to 20 percent in the evaluating database of identifying an internationally authoritative speaker.

Description

technical field [0001] The invention relates to the field of voiceprint recognition, in particular to a speaker recognition method based on prior knowledge regularization covariance probabilistic linear discrimination analysis. Background technique [0002] Speaker recognition technology is a technology that uses the speaker's characteristic information contained in the speech signal to make a judgment and identify the true identity behind it. Speaker recognition technology has been widely used in identification, video conferencing, access control, military criminal investigation and many other fields, and has developed into an increasingly important modern biometric authentication technology. In recent years, the speaker recognition method based on the total variation factor has become the mainstream method in the field of speaker recognition, which does not strictly distinguish between speakers and channels, and models them as a whole. Through this technology, the first-o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L17/02G10L17/04

Inventor 李明蔡炜城

Owner SYSU CMU SHUNDE INT JOINT RES INST

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Probability linear speaker-distinguishing identifying method based on priori knowledge structured covariance

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology