Speaker identification method based on deep stack autoencoder network
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- HUBEI UNIV OF TECH
- Publication Date
- 2019-02-15
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The invention relates to the technical field of computer vision, in particular to a speaker recognition method based on a deep stack autoencoder network. Background technique
[0002] Speaker recognition, also known as voiceprint recognition, is a biometric authentication technology that uses specific speaker information contained in voice signals to identify the identity of the speaker. In recent years, the introduction of the identity vector (i-vector) speaker modeling method based on factor analysis has significantly improved the performance of the speaker recognition system. I-vector uses a low-dimensional total variable space to represent the speaker subspace and channel subspace, and maps the speaker's voice to this space to obtain a fixed-length vector representation (i.e., i-vector). The speaker recognition system based on i-vector mainly includes three steps: extraction of sufficient statistics, i-vector mapping, and calculation of likelihood...