Voiceprint recognition method, device and apparatus for original voice, and storage medium

A voiceprint recognition, the original technology, applied in voice analysis, instruments, etc., can solve the problems of high information loss and high system complexity

Pending Publication Date: 2020-08-11
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The main purpose of the present invention is to solve the problems of high information loss

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voiceprint recognition method, device and apparatus for original voice, and storage medium
  • Voiceprint recognition method, device and apparatus for original voice, and storage medium
  • Voiceprint recognition method, device and apparatus for original voice, and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0069] The embodiment of the present invention provides a method, device, device and storage medium for recognizing voiceprint of original speech. Through a new loss function, the noise information in the channel for recognizing voiceprint characteristic information in the original speech data is eliminated, and the information is reduced. Loss, through the use of a preset convolution filter bank as the front-end pre-processing structure of the original voice data to obtain voiceprint feature data, and a preset deep neural network to pool the voiceprint feature data, through the cosine similarity matrix Loss function and minimum mean square error matrix The loss function processes the voiceprint feature vector to obtain target voiceprint data in the form of a similarity matrix or embedding vector. The input is the original speech data of the speaker and the output is the similarity matrix or embedding vector. The structural form of the target voiceprint data in the form of the s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the field of artificial intelligence, and discloses a voiceprint recognition method for original voice, which is used for reducing information loss and system complexity of anoriginal voice data recognition model of a speaker. The method comprises the steps of obtaining original voice data, and performing segmentation processing on the original voice data according to a preset time length to obtain segmented voice data; performing tail biting convolution processing and discrete Fourier transform processing on the segmented voice data through a preset convolution filter bank to obtain voiceprint feature data; pooling the voiceprint feature data through a preset deep neural network to obtain target voiceprint features; performing embedded vector conversion processing on the target voiceprint feature to obtain a corresponding voiceprint feature vector; and calculating the voiceprint feature vector through a preset loss function to obtain target voiceprint data, the loss function comprising a cosine similarity matrix loss function and a minimum mean square error matrix loss function. The invention also relates to a blockchain technology, and the voiceprint feature data is stored in the blockchain.

Description

technical field [0001] The invention relates to the field of speech signal processing, in particular to a voiceprint recognition method, device, equipment and storage medium for original speech. Background technique [0002] At present, the speaker's original speech data recognition model extracts the feature information from the speaker's original speech data through artificial feature engineering, generates vector data of the feature information, and performs channel noise fitting processing on the vector data to obtain the fitting processing data. Fitting and processing the data for speaker recognition and obtaining the corresponding speaker information. [0003] Since the obtained vector data cannot be directly used to identify the channel information difference between the same speaker or different speakers, it is necessary to perform channel noise fitting processing on the obtained vector data to obtain the fitting processing data, and then the The back-end reprocessi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L17/02G10L17/06G10L17/18
CPCG10L17/02G10L17/18G10L17/06Y02D30/70G10L25/18G10L25/21
Inventor 郭跃超谯轶轩唐义君王俊高鹏谢国彤
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products