Speaker identification method based on deep stack autoencoder network

A technology of speaker recognition and self-encoding network, applied in speech analysis, instruments, etc., can solve problems such as limiting model performance, achieve the effects of improving recognition performance, reducing system performance impact, and improving robustness
CN109346084APending Publication Date: 2019-02-15HUBEI UNIV OF TECH

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
HUBEI UNIV OF TECH
Publication Date
2019-02-15

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention relates to a speaker identification method based on the deep stack autoencoder network. The method comprises steps of S1, speaker feature extraction; S2, stack autoencoder network design; and S3: speaker identification and decision making. The method is advantaged in that compared with traditional speaker identification, the deep stack autoencoder network is fused with a speaker identification system model, in combination with the multi-layer structure of a stack autoencoder to improve the characterization ability of an evaluation model, system identification performance in the presence of background noise can be finitely improved, influence of the noise on the system performance is reduced, system noise robustness is improved, the system structure is optimized, and identification timeliness is effectively enhanced.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to the technical field of computer vision, in particular to a speaker recognition method based on a deep stack autoencoder network. Background technique

[0002] Speaker recognition, also known as voiceprint recognition, is a biometric authentication technology that uses specific speaker information contained in voice signals to identify the identity of the speaker. In recent years, the introduction of the identity vector (i-vector) speaker modeling method based on factor analysis has significantly improved the performance of the speaker recognition system. I-vector uses a low-dimensional total variable space to represent the speaker subspace and channel subspace, and maps the speaker's voice to this space to obtain a fixed-length vector representation (i.e., i-vector). The speaker recognition system based on i-vector mainly includes three steps: extraction of sufficient statistics, i-vector mapping, and calculation of likelihood...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More