Voiceprint recognition method, device and apparatus for original voice, and storage medium

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A voiceprint recognition, the original technology, applied in voice analysis, instruments, etc., can solve the problems of high information loss and high system complexity

Pending Publication Date: 2020-08-11

PING AN TECH (SHENZHEN) CO LTD

View PDF0 Cites 11 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The main purpose of the present invention is to solve the problems of high information loss and high system complexity existing in the existing speaker original speech data recognition model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0069] Embodiments of the present invention provide a method, device, device, and storage medium for voiceprint recognition of original voice. Through a new loss function, the noise information in the channel for identifying voiceprint feature information in original voice data is eliminated, and the information is reduced. Loss, by using the preset convolution filter bank as the front-end preprocessing structure of the original voice data to obtain the voiceprint feature data, and the preset deep neural network performs pooling processing on the voiceprint feature data, through the cosine similarity matrix Loss function and minimum mean square error matrix The loss function processes the voiceprint feature vector to obtain the target voiceprint data in the form of similarity matrix or embedding vector, the input end is the speaker's original voice data, and the output end is the similarity matrix or embedding vector The structural form of the target voiceprint data in the form...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to the field of artificial intelligence, and discloses a voiceprint recognition method for original voice, which is used for reducing information loss and system complexity of anoriginal voice data recognition model of a speaker. The method comprises the steps of obtaining original voice data, and performing segmentation processing on the original voice data according to a preset time length to obtain segmented voice data; performing tail biting convolution processing and discrete Fourier transform processing on the segmented voice data through a preset convolution filter bank to obtain voiceprint feature data; pooling the voiceprint feature data through a preset deep neural network to obtain target voiceprint features; performing embedded vector conversion processing on the target voiceprint feature to obtain a corresponding voiceprint feature vector; and calculating the voiceprint feature vector through a preset loss function to obtain target voiceprint data, the loss function comprising a cosine similarity matrix loss function and a minimum mean square error matrix loss function. The invention also relates to a blockchain technology, and the voiceprint feature data is stored in the blockchain.

Description

technical field [0001] The invention relates to the field of speech signal processing, in particular to a voiceprint recognition method, device, equipment and storage medium for original speech. Background technique [0002] At present, the speaker's original speech data recognition model extracts the feature information from the speaker's original speech data through artificial feature engineering, generates vector data of the feature information, and performs channel noise fitting processing on the vector data to obtain the fitting processing data. Fitting and processing the data for speaker recognition and obtaining the corresponding speaker information. [0003] Since the obtained vector data cannot be directly used to identify the channel information difference between the same speaker or different speakers, it is necessary to perform channel noise fitting processing on the obtained vector data to obtain the fitting processing data, and then the The back-end reprocessi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L17/02G10L17/06G10L17/18

CPCG10L17/02G10L17/18G10L17/06Y02D30/70G10L25/18G10L25/21

Inventor郭跃超谯轶轩唐义君王俊高鹏谢国彤

OwnerPING AN TECH (SHENZHEN) CO LTD

Voiceprint recognition method, device and apparatus for original voice, and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology