Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A voiceprint recognition method based on 3D convolution neural network

A convolutional neural network and voiceprint recognition technology, applied in the field of voiceprint recognition based on 3D convolutional neural network, can solve the problems of voice data sensitivity, channel environment noise sensitivity, inability to meet, etc., to achieve accurate voiceprint recognition, physical Direct meaning, the effect of improving recognition performance

Inactive Publication Date: 2019-01-15
GUANGDONG UNIV OF TECH
View PDF4 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Common voiceprint recognition methods, such as the early voiceprint recognition method based on signal processing, use some technical methods in signal processing technology to calculate the parameters of voice data in signal science, and then perform template matching and statistical analysis of variance. The data is extremely sensitive, the accuracy rate is very low, and the recognition effect is very unsatisfactory
The recognition method based on the Gaussian mixture model can achieve good results and is simple and flexible, but it requires a large amount of voice data and is very sensitive to channel environmental noise, which cannot meet the requirements of real scenarios.
The existing deep learning neural network-based methods do not consider the context-dependent nature of the speech signal, and the extracted features cannot represent the speaker well, and do not fully utilize the advantages of deep learning

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A voiceprint recognition method based on 3D convolution neural network
  • A voiceprint recognition method based on 3D convolution neural network
  • A voiceprint recognition method based on 3D convolution neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] The present invention will be further described in detail below in conjunction with the embodiments and the accompanying drawings, but the embodiments of the present invention are not limited thereto.

[0023] Such as Figure 1~4 Shown, a kind of voiceprint recognition method based on 3D convolutional neural network, comprises the following steps:

[0024] Step 1 is the preprocessing of the speech signal; more channel noise will be generated in the speech collection process, which will bring greater difficulties to the recognition task, so the spectral subtraction method is first used to denoise the input speech data, namely The noise spectrum estimate is subtracted from the noisy speech estimate to obtain the spectrum of the pure speech; the channel noise is eliminated here, and the channel noise is the noise caused by the recording device; while the channel noise is removed, the same All information about the speaker;

[0025] Wherein, the specific steps of the spec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a voiceprint recognition method based on 3D convolution neural network, which comprises the following steps: step 1, preprocessing the speech signal. In the process of speech acquisition, there will be more channel noise, which will bring great difficulty to the recognition task, therefore, firstly, the input speech data is denoised by spectral subtraction, that is, noise spectrum estimation is subtracted from the noise speech estimation, so as to obtain the spectrum of pure speech. Here, the channel noise is eliminated, and the channel noise is the noise caused by therecording equipment; all the information about the speaker is completely preserved while the channel noise is removed. Compared with other methods, the spectral subtraction method introduces the leastconstraint conditions, the most direct physical meaning and the small calculation amount, so that the accuracy of recognition can be effectively improved.

Description

technical field [0001] The invention relates to the technical field of deep learning and voice signal processing, in particular to a voiceprint recognition method based on a 3D convolutional neural network. Background technique [0002] With the rapid development of the world's informatization process, information security issues are becoming more and more serious, and the impact is increasing. The issue of "personal privacy and confidentiality" urgently needs to be resolved; and how to accurately and safely determine a person's identity has aroused people's thinking. Traditional security authentication methods such as password or key authentication are usually easy to forget, lose, and steal. Common voiceprint recognition methods, such as the early voiceprint recognition method based on signal processing, use some technical methods in signal processing technology to calculate the parameters of voice data in signal science, and then perform template matching and statistical ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L17/18G10L17/02G10L21/0232G10L25/24
CPCG10L17/02G10L17/18G10L21/0232G10L25/24
Inventor 王艺航熊晓明刘祥李辉
Owner GUANGDONG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products