Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speaker recognition system and method

A speaker recognition and speaker technology, applied in the field of speaker recognition system, can solve the problems of long training time of GMM, unclear physical meaning of GMM, unclear Gaussian component, etc.

Inactive Publication Date: 2011-04-20
SONY CORP
View PDF3 Cites 51 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] An important shortcoming of GMM-based modeling is that the physical meaning of the trained GMM is very unclear, that is, it is not clear which features contribute to each Gaussian component.
In addition, GMM training takes a long time due to the need for a large number of speakers' utterances

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speaker recognition system and method
  • Speaker recognition system and method
  • Speaker recognition system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] Hereinafter, embodiments of the present invention will be described with reference to the drawings. It should be noted that, for the purpose of clarity, representations and descriptions of components and processes that are not related to the present invention and known to those of ordinary skill in the art are omitted from the drawings and descriptions.

[0041] First, reference will be made to the drawings, especially Figure 1 to Figure 4 , Describe the general working process of the speaker recognition system according to the embodiment of the present invention. Such as figure 1 As shown, the speaker recognition system according to the embodiment of the present invention includes: a feature extraction unit 101 configured to extract a feature vector of the speaker’s voice data; a background model generation unit 103 configured to analyze a feature vector of the background speaker’s voice data Perform internal clustering and generate a general background model for genera...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speaker recognition system and a speaker recognition method. The speaker recognition system comprises a characteristic extraction unit, a background model generation unit, a registered speaker model generation unit, a metric value calculation unit and a recognition unit, wherein the characteristic extraction unit is configured to extract a characteristic vector of speech data of a speaker; the background model generation unit is configured to perform internal clustering on the characteristic vector of the speech data of a background speaker and generate a universal background model aiming at a normal speaker according to the result of the internal clustering; the registered speaker model generation unit is configured to adapt to the universal background model by using the characteristic vector of the speech data of each registered speaker so as to generate a registered speaker model of each registered speaker; the metric value calculation unit is configured to calculate metric values of the characteristic vector of a tested speaker on the universal background model generated by the background model generation unit and on the registered speaker model of each registered speaker, which is generated by the registered speaker model generation model; and the recognition unit is configured to recognize the tested speaker according to the metric values calculated by the metric value calculation unit.

Description

Technical field [0001] The present invention generally relates to speaker recognition systems and methods. More specifically, the present invention relates to a specific speaker recognition system and method based on a universal background model (UBM) and a registered speaker model. Background technique [0002] At present, the main research biometric technology in various countries includes hand shape recognition, fingerprint recognition, facial recognition, voiceprint recognition, iris recognition, signature recognition, etc. Among these biological characteristics, fingerprints, iris, face, etc. are all exposed physical characteristics, which can easily be impersonated by criminals using the physical characteristics of the person involved in the case of violence. Human voice features are built-in physical features. As long as the person does not speak, there is no possibility of being misappropriated. Therefore, in-depth research and development have been made in the field of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L17/00G10L15/00G10L15/01
Inventor 刘昆吴伟国
Owner SONY CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products