Voiceprint identification method based on Gauss mixing model and system thereof

A Gaussian mixture model and voiceprint recognition technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of random initial parameters of the model and affecting the recognition rate of the system
CN102324232AInactive Publication Date: 2012-01-18LIAONING UNIVERSITY OF TECHNOLOGY

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
LIAONING UNIVERSITY OF TECHNOLOGY
Publication Date
2012-01-18
Estimated Expiration
Not applicable · inactive patent

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention provides a voiceprint identification method based on a Gauss mixing model and a system thereof. The method comprises the following steps: voice signal acquisition; voice signal pretreatment; voice signal characteristic parameter extraction: employing a Mel Frequency Cepstrum Coefficient (MFCC), wherein an order number of the MFCC usually is 12-16; model training: employing an EM algorithm to train a Gauss mixing model (GMM) for a voice signal characteristic parameter of a speaker, wherein a k-means algorithm is selected as a parameter initialization method of the model; voiceprint identification: comparing a collected voice signal characteristic parameter to be identified with an established speaker voice model, carrying out determination according to a maximum posterior probability method, and if a corresponding speaker model enables a speaker voice characteristic vector X to be identified to has maximum posterior probability, identifying the speaker. According to the method, the Gauss mixing model based on probability statistics is employed, characteristic distribution of the speaker in characteristic space can be reflected well, a probability density function is common, a parameter in the model is easy to estimate and train, and the method has good identification performance and anti-noise capability.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention belongs to a voice signal processing device, and relates to a Gaussian mixture model-based voiceprint recognition method and system for identifying the speaker's identity by using the speaker's voice signal. Background technique

[0002] In recent years, with the wide application of information processing and artificial intelligence technology, and people's urgent requirements for fast and effective identity verification, traditional password authentication has gradually lost its status. In the field of biometrics, speaker-based Voice identification technology has been favored by more and more people.

[0003] Due to the physiological differences in the pronunciation organs of each person and the acquired behavioral differences, the pronunciation methods and speaking habits are different, so it is possible to use the speaker's voice to identify the identity. In addition to the advantages of no forgetting, no need to remember, and conveni...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More