Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voiceprint recognition method, device and apparatus for original voice, and storage medium

A voiceprint recognition, the original technology, applied in voice analysis, instruments, etc., can solve the problems of high information loss and high system complexity

Pending Publication Date: 2020-08-11
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The main purpose of the present invention is to solve the problems of high information loss and high system complexity existing in the existing speaker original speech data recognition model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voiceprint recognition method, device and apparatus for original voice, and storage medium
  • Voiceprint recognition method, device and apparatus for original voice, and storage medium
  • Voiceprint recognition method, device and apparatus for original voice, and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0069] Embodiments of the present invention provide a method, device, device, and storage medium for voiceprint recognition of original voice. Through a new loss function, the noise information in the channel for identifying voiceprint feature information in original voice data is eliminated, and the information is reduced. Loss, by using the preset convolution filter bank as the front-end preprocessing structure of the original voice data to obtain the voiceprint feature data, and the preset deep neural network performs pooling processing on the voiceprint feature data, through the cosine similarity matrix Loss function and minimum mean square error matrix The loss function processes the voiceprint feature vector to obtain the target voiceprint data in the form of similarity matrix or embedding vector, the input end is the speaker's original voice data, and the output end is the similarity matrix or embedding vector The structural form of the target voiceprint data in the form...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the field of artificial intelligence, and discloses a voiceprint recognition method for original voice, which is used for reducing information loss and system complexity of anoriginal voice data recognition model of a speaker. The method comprises the steps of obtaining original voice data, and performing segmentation processing on the original voice data according to a preset time length to obtain segmented voice data; performing tail biting convolution processing and discrete Fourier transform processing on the segmented voice data through a preset convolution filter bank to obtain voiceprint feature data; pooling the voiceprint feature data through a preset deep neural network to obtain target voiceprint features; performing embedded vector conversion processing on the target voiceprint feature to obtain a corresponding voiceprint feature vector; and calculating the voiceprint feature vector through a preset loss function to obtain target voiceprint data, the loss function comprising a cosine similarity matrix loss function and a minimum mean square error matrix loss function. The invention also relates to a blockchain technology, and the voiceprint feature data is stored in the blockchain.

Description

technical field [0001] The invention relates to the field of speech signal processing, in particular to a voiceprint recognition method, device, equipment and storage medium for original speech. Background technique [0002] At present, the speaker's original speech data recognition model extracts the feature information from the speaker's original speech data through artificial feature engineering, generates vector data of the feature information, and performs channel noise fitting processing on the vector data to obtain the fitting processing data. Fitting and processing the data for speaker recognition and obtaining the corresponding speaker information. [0003] Since the obtained vector data cannot be directly used to identify the channel information difference between the same speaker or different speakers, it is necessary to perform channel noise fitting processing on the obtained vector data to obtain the fitting processing data, and then the The back-end reprocessi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L17/02G10L17/06G10L17/18
CPCG10L17/02G10L17/18G10L17/06Y02D30/70G10L25/18G10L25/21
Inventor 郭跃超谯轶轩唐义君王俊高鹏谢国彤
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products