A voiceprint recognition method based on 3D convolution neural network

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A convolutional neural network and voiceprint recognition technology, applied in the field of voiceprint recognition based on 3D convolutional neural network, can solve the problems of voice data sensitivity, channel environment noise sensitivity, inability to meet, etc., to achieve accurate voiceprint recognition, physical Direct meaning, the effect of improving recognition performance

Inactive Publication Date: 2019-01-15

GUANGDONG UNIV OF TECH

View PDF4 Cites 20 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Common voiceprint recognition methods, such as the early voiceprint recognition method based on signal processing, use some technical methods in signal processing technology to calculate the parameters of voice data in signal science, and then perform template matching and statistical analysis of variance. The data is extremely sensitive, the accuracy rate is very low, and the recognition effect is very unsatisfactory

The recognition method based on the Gaussian mixture model can achieve good results and is simple and flexible, but it requires a large amount of voice data and is very sensitive to channel environmental noise, which cannot meet the requirements of real scenarios.

The existing deep learning neural network-based methods do not consider the context-dependent nature of the speech signal, and the extracted features cannot represent the speaker well, and do not fully utilize the advantages of deep learning

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0022] The present invention will be further described in detail below in conjunction with the embodiments and the accompanying drawings, but the embodiments of the present invention are not limited thereto.

[0023] Such as Figure 1~4 Shown, a kind of voiceprint recognition method based on 3D convolutional neural network, comprises the following steps:

[0024] Step 1 is the preprocessing of the speech signal; more channel noise will be generated in the speech collection process, which will bring greater difficulties to the recognition task, so the spectral subtraction method is first used to denoise the input speech data, namely The noise spectrum estimate is subtracted from the noisy speech estimate to obtain the spectrum of the pure speech; the channel noise is eliminated here, and the channel noise is the noise caused by the recording device; while the channel noise is removed, the same All information about the speaker;

[0025] Wherein, the specific steps of the spec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a voiceprint recognition method based on 3D convolution neural network, which comprises the following steps: step 1, preprocessing the speech signal. In the process of speech acquisition, there will be more channel noise, which will bring great difficulty to the recognition task, therefore, firstly, the input speech data is denoised by spectral subtraction, that is, noise spectrum estimation is subtracted from the noise speech estimation, so as to obtain the spectrum of pure speech. Here, the channel noise is eliminated, and the channel noise is the noise caused by therecording equipment; all the information about the speaker is completely preserved while the channel noise is removed. Compared with other methods, the spectral subtraction method introduces the leastconstraint conditions, the most direct physical meaning and the small calculation amount, so that the accuracy of recognition can be effectively improved.

Description

technical field [0001] The invention relates to the technical field of deep learning and voice signal processing, in particular to a voiceprint recognition method based on a 3D convolutional neural network. Background technique [0002] With the rapid development of the world's informatization process, information security issues are becoming more and more serious, and the impact is increasing. The issue of "personal privacy and confidentiality" urgently needs to be resolved; and how to accurately and safely determine a person's identity has aroused people's thinking. Traditional security authentication methods such as password or key authentication are usually easy to forget, lose, and steal. Common voiceprint recognition methods, such as the early voiceprint recognition method based on signal processing, use some technical methods in signal processing technology to calculate the parameters of voice data in signal science, and then perform template matching and statistical ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L17/18G10L17/02G10L21/0232G10L25/24

CPCG10L17/02G10L17/18G10L21/0232G10L25/24

Inventor 王艺航熊晓明刘祥李辉

Owner GUANGDONG UNIV OF TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

A voiceprint recognition method based on 3D convolution neural network

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology