Voiceprint recognition method and device

A voiceprint recognition and voiceprint feature technology, applied in the field of identity authentication, can solve the problems of poor signal-to-noise ratio, long consumption time, complex noise types, etc., to achieve the effect of improving performance and resisting noise interference

Inactive Publication Date: 2014-08-06
TENCENT TECH (SHENZHEN) CO LTD
View PDF3 Cites 69 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the application scenario of speaker recognition technology, the collected voice data is unlikely to be clean, and the types of noise contained in it are complex, and the signal-to-noise ratio is very poor.
If the traditional underlying spectrum-based features

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voiceprint recognition method and device
  • Voiceprint recognition method and device
  • Voiceprint recognition method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0026] First of all, speaker identification is a multi-classification problem, while speaker verification is a binary classification problem, and a multi-classification problem can be converted into multiple binary classification problems. Therefore, the speaker confirmation problem can be used as an example to illustrate the relevant details of the embodiments of the present invention.

[0027] In fact, those skilled in the art can realize that the embodiments of the present invention are also applicable to the problem of speaker identification.

[0028] Text-independent speaker recognition does not need to store a specific text password, but directly uses the speaker's voice as a password, which can be widely used in security fields such as Internet user identit...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A method and system (800) for voiceprint recognition, include: establishing a first-level Deep Neural Network (DNN) model based on unlabeled speech data, the unlabeled speech data containing no speaker labels and the first-level DNN model specifying a plurality of basic voiceprint features for the unlabeled speech data; obtaining a plurality of high-level voiceprint features by tuning the first-level DNN model based on labeled speech data, the labeled speech data containing speech samples with respective speaker labels, and the tuning producing a second-level DNN model specifying the plurality of high-level voiceprint features; based on the second-level DNN model, registering a respective high-level voiceprint feature sequence for a user based on a registration speech sample received from the user; and performing speaker verification for the user based on the respective high-level voiceprint feature sequence registered for the user.

Description

technical field [0001] The embodiments of the present invention relate to the technical field of identity authentication, and more specifically, to a voiceprint recognition method and device. Background technique [0002] Voiceprint Recognition (VPR) is a kind of biometric technology, also known as speaker recognition (Speaker Recognition). Speaker identification includes two categories, namely speaker identification (Speaker Identification) and speaker confirmation (Speaker Verification). Speaker identification is used to judge which of several people said a certain speech, it is a "multiple choice" problem; and speaker confirmation is used to confirm whether a certain speech is spoken by a specified person, it is " One-to-one discrimination" problem. [0003] Voiceprint recognition includes two types: Text-Dependent and Text-Independent. The text-related voiceprint recognition system requires users to pronounce according to the specified content. Each person’s voiceprin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L17/20
CPCG10L17/20G10L17/18
Inventor 王尔玉卢鲤张翔刘海波李露饶丰陆读羚岳帅陈波
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products