Voiceprint recognition method based on RNN

A technology of voiceprint recognition and subtraction method, which is applied in speech analysis, instruments, etc., can solve the problems of not considering the context correlation of speech data, not using deep learning to extract features, and features not being able to represent the speaker well, etc. To achieve the effect of improving the recognition performance

Active Publication Date: 2018-02-23
王华锋
View PDF14 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] The technical problem solved by the present invention is: overcoming the context correlation of voice data that is not considered in the existing voiceprint recognition method, the extracted features cannot represent the speaker well, and the powerful feature extraction ability of deep learning has not been brought into play And other issues

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voiceprint recognition method based on RNN
  • Voiceprint recognition method based on RNN
  • Voiceprint recognition method based on RNN

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] figure 1 The overall processing flow of the present invention is given. The present invention provides a method for voiceprint recognition based on RNN. The main steps are as follows: firstly, denoise the input voice data using spectral subtraction, and extract the MFCC characteristic parameters and first and second order differences of the obtained pure voice data by frame , and then concatenate the MFCC parameters and the first and second order differences of multiple consecutive frames to obtain a two-dimensional feature parameter matrix, which is used as the input of the cyclic neural network. The present invention uses the variant LSTM of the cyclic neural network for training, and the LSTM passes through a unique gate mechanism-a forget gate, an input gate and an output gate ( figure 1 The δ in is the sigmoid activation function, tanh is the hyperbolic tangent activation function) to retain the long-term information of the sequence, and thus complete the training...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a voiceprint recognition method based on RNN. According to the voiceprint recognition method based on RNN, after MFCC features of denoised voice data and first-order and second-order differences are obtained, the advanced features of a speaker in the MFCC features are extracted by using a recurrent neural network, the extracted features are classified by using a softmax classifier, and finally, the speaker is recognized by using a naive bayes method. Different from mute elimination of a traditional method, the method provided by the invention has the advantages that a mute section in the voice data is reserved, the features related to the context can be extracted on the basis of the recurrent neural network, the advanced features of the voice, such as the speaking mode and the rhythm of the speaker can be extracted aiming at the voice data, so that the feature information is complete, and the speaker can be well represented. Compared with an existing voiceprint recognition method based on Gauss, the method provided by the invention has the advantages that the requirements for voice data are relatively low, the accuracy is higher, the accuracy is still kept ata very high level even when big data is aimed, and the running speed is not reduced.

Description

technical field [0001] The invention provides an RNN-based voiceprint recognition method, which relates to the fields of deep learning, pattern recognition, and voice signal processing. Background technique [0002] With the rapid development of information technology, how to accurately authenticate a person's identity, protect personal privacy and ensure information security has become an urgent problem to be solved. Compared with traditional identity authentication methods, biometric identity authentication technology has the characteristics that it will not be lost, stolen or forgotten during use; it is not only fast, convenient, but also accurate and reliable. As one of the most popular biometric identification technologies at present, voiceprint recognition has unique advantages in remote authentication and other application fields, and has attracted more and more attention. WeChat has enabled the voice lock verification login method, the world's first voice lock Lenov...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L17/02G10L17/04G10L17/18G10L17/14G10L25/24G10L25/18G10L21/0208
CPCG10L17/02G10L17/04G10L17/14G10L17/18G10L21/0208G10L25/18G10L25/24
Inventor 冯毅夫王华锋徐雷杜俊逸付明霞马晨南齐一凡潘海侠
Owner 王华锋
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products