Speaker recognition system and method

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speaker recognition and speaker technology, applied in the field of speaker recognition system, can solve the problems of long training time of GMM, unclear physical meaning of GMM, unclear Gaussian component, etc.

Inactive Publication Date: 2011-04-20

SONY CORP

View PDF3 Cites 51 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0008] An important shortcoming of GMM-based modeling is that the physical meaning of the trained GMM is very unclear, that is, it is not clear which features contribute to each Gaussian component.

In addition, GMM training takes a long time due to the need for a large number of speakers' utterances

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0040] Hereinafter, embodiments of the present invention will be described with reference to the drawings. It should be noted that, for the purpose of clarity, representations and descriptions of components and processes that are not related to the present invention and known to those of ordinary skill in the art are omitted from the drawings and descriptions.

[0041] First, reference will be made to the drawings, especially Figure 1 to Figure 4 , Describe the general working process of the speaker recognition system according to the embodiment of the present invention. Such as figure 1 As shown, the speaker recognition system according to the embodiment of the present invention includes: a feature extraction unit 101 configured to extract a feature vector of the speaker’s voice data; a background model generation unit 103 configured to analyze a feature vector of the background speaker’s voice data Perform internal clustering and generate a general background model for genera...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a speaker recognition system and a speaker recognition method. The speaker recognition system comprises a characteristic extraction unit, a background model generation unit, a registered speaker model generation unit, a metric value calculation unit and a recognition unit, wherein the characteristic extraction unit is configured to extract a characteristic vector of speech data of a speaker; the background model generation unit is configured to perform internal clustering on the characteristic vector of the speech data of a background speaker and generate a universal background model aiming at a normal speaker according to the result of the internal clustering; the registered speaker model generation unit is configured to adapt to the universal background model by using the characteristic vector of the speech data of each registered speaker so as to generate a registered speaker model of each registered speaker; the metric value calculation unit is configured to calculate metric values of the characteristic vector of a tested speaker on the universal background model generated by the background model generation unit and on the registered speaker model of each registered speaker, which is generated by the registered speaker model generation model; and the recognition unit is configured to recognize the tested speaker according to the metric values calculated by the metric value calculation unit.

Description

Technical field [0001] The present invention generally relates to speaker recognition systems and methods. More specifically, the present invention relates to a specific speaker recognition system and method based on a universal background model (UBM) and a registered speaker model. Background technique [0002] At present, the main research biometric technology in various countries includes hand shape recognition, fingerprint recognition, facial recognition, voiceprint recognition, iris recognition, signature recognition, etc. Among these biological characteristics, fingerprints, iris, face, etc. are all exposed physical characteristics, which can easily be impersonated by criminals using the physical characteristics of the person involved in the case of violence. Human voice features are built-in physical features. As long as the person does not speak, there is no possibility of being misappropriated. Therefore, in-depth research and development have been made in the field of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L17/00G10L15/00G10L15/01

Inventor刘昆吴伟国

OwnerSONY CORP

Speaker recognition system and method

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology