Voice and face composite identity authentication method based on end-to-end deep neural network

A deep neural network and identity authentication technology, which is applied in the field of voice and face composite identity authentication based on end-to-end deep neural network, can solve problems such as the impact of voiceprint recognition, and achieve the effect of improving recognition accuracy

Inactive Publication Date: 2018-08-14
成都数智凌云科技有限公司
View PDF5 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in actual engineering applications, factors such as lighting changes in the application environment and face posture have a greater impact on face recognition, and the state of the speaker's vocal tract and environmental noise also have a greater impact on voiceprint recognition.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice and face composite identity authentication method based on end-to-end deep neural network
  • Voice and face composite identity authentication method based on end-to-end deep neural network
  • Voice and face composite identity authentication method based on end-to-end deep neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0038] like figure 1 Shown is a schematic flow chart of the voice and face composite identity authentication method based on the end-to-end deep neural network of the present invention. A voice and face composite identity authentication method based on an end-to-end deep neural network, comprising the following steps:

[0039] A. Collect preset text-related voice signals, and simultaneously collect facial video signals of people to be identified;

[0040] B, extracting the speech voiceprint feature of the text-related speech signal in step A;

[0041] C, extract the multi-frame human face featur...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a voice and face composite identity authentication method based on an end-to-end deep neural network. The method comprises the steps that a text-related voice signal and a facial video signal are collected, voice vocal print features and multi-frame face features are extracted, the voice vocal print features and the multi-frame face features are connected to obtain an identity feature vector, dimension reduction processing is carried out on the identity feature vector, and personal identity verification is carried out by adopting a Triplets Loss method. The voice and face composite identity authentication method based on the end-to-end deep neural network has the advantages that identity authentication is carried out by combining the voice and face features, the deficiency caused by a single feature is made up for, the voice and face composite identity authentication method can be applied to occasions such as entrance guard and check on work attendance, man-machine interaction and the like, and the identification accuracy is effectively improved.

Description

technical field [0001] The invention belongs to the technical field of identity recognition, and in particular relates to an end-to-end deep neural network-based voice and face composite identity authentication method. Background technique [0002] With the development of artificial intelligence technology, identity verification methods such as face recognition and voiceprint recognition have been widely used in the field of intelligent security. Among them, FaceNet, a face recognition model based on convolutional neural network developed by Google, directly learns an encoding method from image to Euclidean space end-to-end, and then performs face recognition, face verification and face clustering based on this encoding. Wait. FaceNet has an accuracy rate of 0.9963 on the LFW dataset and an accuracy rate of 0.9512 on the YouTube Faces DB dataset. The Deep Speaker developed by Baidu is composed of deep neural network layers. It uses MFCC, time pooling based on cosine simila...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/62G10L17/18G10L25/18G10L25/24
CPCG10L17/18G10L25/18G10L25/24G06V40/168G06F18/213
Inventor 胡德昆易发胜崔国栋
Owner 成都数智凌云科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products