Text-independent speaker identifying device based on line spectrum frequency difference value

A technology of speaker identification and line spectrum frequency, applied in speech analysis, instruments, etc., can solve the problems of inconvenient use, easy to be recorded and falsely identified by the recognition system, and difficult to build a speaker model.

Inactive Publication Date: 2014-06-18
BEIJING UNIV OF POSTS & TELECOMM
View PDF2 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Text-independent speaker recognition technology does not specify the content of the speech, whether it is training or recognition. The recognition object is a free speech signal. It is necessary to find the characteristics and methods that can represent the information of the speaker in the

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text-independent speaker identifying device based on line spectrum frequency difference value
  • Text-independent speaker identifying device based on line spectrum frequency difference value
  • Text-independent speaker identifying device based on line spectrum frequency difference value

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] Specific embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0034] figure 1 It is a flowchart of the present invention, wherein the dotted line represents the direction of the training part of the process, and the solid line represents the direction of the identification part of the process, including the following steps:

[0035] The first step: feature extraction step, feature extraction of the speaker's voice sequence to be trained

[0036] Step S1: converting the line spectrum frequency parameter into a line spectrum frequency parameter difference;

[0037] Step S2: generating a line spectrum frequency feature supervector;

[0038] Step 2: Train the model

[0039] Step S3: use the super-Dirichlet mixture model to simulate the distribution of feature supervectors, and solve the parameters in the model;

[0040] Step Three: Identification Process

[0041] Repeat step S1 and step S2 in the first ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a text-independent speaker identifying device based on a line spectrum frequency difference value. A method comprises the following steps: the feature extraction step, wherein a line spectrum frequency parameter is converted into the line spectrum frequency parameter difference value through linear conversion, a generation line spectrum frequency characteristic supervector is formed by combining a current frame, a front adjacent frame and a rear adjacent frame; the model training step, wherein distribution of the characteristic supervector is simulated by utilizing a super Dirichlet mixed model, and a parameter in the model is solved; the identifying step, wherein regarding a voice sequence of an identified person, characteristics are abstracted according the step one, then the model obtained in the step two is input, a likelihood value of each probabilistic model is calculated, the largest likelihood value is obtained, and a number of the speaker is confirmed. By means of the method, the text-independent speaker identification rate can be improved, and high practical value is achieved.

Description

technical field [0001] The invention emphatically describes a text-independent speaker recognition system based on linear transformation line spectrum frequency parameters and super Dirichlet mixed model. Background technique [0002] With the development of computer technology, the use of human biometrics (such as fingerprints, voiceprints, and faces) for identification or confirmation has very important research and application values. Speaker recognition is to automatically confirm whether the speaker is in the recorded set of speakers according to the voice parameters reflecting the characteristics of the speaker's physiology and behavior in the voice waveform, and further confirm the identity of the speaker. Speaker recognition includes two parts: speaker identification and speaker confirmation. The speaker identification system usually includes three parts: extracting features that can represent the speaker, training an independent model for each speaker that conforms...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L17/02G10L17/04
Inventor 马占宇齐峰张洪刚
Owner BEIJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products