Voiceprint retrieval method based on deep Hash

A voiceprint and depth technology, applied in the field of fast voiceprint retrieval, can solve the problems of low accuracy in the training process, inability to learn more differentiated hash codes, and low retrieval efficiency

Active Publication Date: 2019-10-08
NANJING UNIV
View PDF3 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Purpose of the invention: The current voiceprint retrieval methods are mainly based on real-valued vectors and based on hash coding: the voiceprint retrieval based on real-valued vectors has the problem of low retrieval efficiency in the face of large-scale data; Since the voiceprint retrieval of the Greek code adopts a two-stage training process, the i-vector is extracted first, and then the hash function is used to solve the hash code for the i-vector. ...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voiceprint retrieval method based on deep Hash
  • Voiceprint retrieval method based on deep Hash
  • Voiceprint retrieval method based on deep Hash

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] Below in conjunction with specific embodiment, further illustrate the present invention, should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various equivalent forms of the present invention All modifications fall within the scope defined by the appended claims of the present application.

[0041] The voiceprint retrieval method based on deep hashing, the training process of the deep voiceprint hash model is as follows: figure 1shown. First collect the voices of marked speakers as a training set, and assign training labels according to the identity of the speakers (step 10). Then build the deep neural network model and initialize the model parameters (step 11): the network structure of the deep voiceprint hash model uses multi-layer convolutional layers as the backbone. Here, the mu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a voiceprint retrieval method based on deep Hash by which the effects of low storage space and efficient retrieval in a voiceprint retrieval task are achieved. The method comprises a step of training a deep voiceprint hash model, a step of constructing a hash coding database and a step of retrieving the query voice in the database, and is characterized by firstly constructing an end-to-end deep neural network structure, and training the deep neural network model by utilizing the voice data marked with a speaker identity to obtain a deep voiceprint hash function, and then calculating the Hash codes corresponding to the training set through the deep voiceprint Hash function, and constructing a database; for the newly inputted voice data, using the deep voiceprint hashfunction to calculate a corresponding hash code, and adding the hash code to a database in real time. During the retrieval process, for the given voice, the deep voiceprint hash function is used forcalculating the corresponding hash code, and finally a retrieval result is obtained in the database based on index or Hamming distance sorting.

Description

technical field [0001] The invention relates to a voiceprint retrieval method based on deep hashing, which is used to realize fast voiceprint retrieval of a large-scale voice database under low storage overhead. Background technique [0002] Voiceprint retrieval uses a given voice to retrieve one or more voices from the same speaker as this voice in the database. Due to the popularity of microphone recording devices such as mobile phones, personal computers, etc. in recent years, and the rapid development of network media, a large number of voices and videos have sprung up, and hundreds of hours of videos are uploaded to the cloud every minute. The use of voice retrieval is becoming more and more extensive, such as recommending similar voices through voice retrieval; detecting infringements through voice retrieval; in large-scale voiceprint authentication, too many speakers will lead to slow authentication speed, you can also use Retrieval technology speeds up the authentic...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/61G06F16/63G06F16/65G06F16/683
CPCG06F16/61G06F16/63G06F16/65G06F16/683Y02D10/00
Inventor 李武军樊磊蒋庆远余亚奇
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products