Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speaker marking method and system based on density peak value clustering and variational Bayes

A variational Bayesian, density peaking technique, applied in the field of speaker tagging based on density peaking clustering and variational Bayesian, which can solve the problems of uncertainty, large deviation of speaker tagging performance, etc.

Active Publication Date: 2017-07-21
北京华控智加科技有限公司
View PDF5 Cites 37 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The invention solves the uncertainty of the initial value estimation of the number of speakers and the prior probability of the speaker at each moment in the prior art, and the speaker marking performance is easily affected by the initial value and produces large deviations; the enhancement The accuracy, stability and flexibility of speaker labeling

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speaker marking method and system based on density peak value clustering and variational Bayes
  • Speaker marking method and system based on density peak value clustering and variational Bayes
  • Speaker marking method and system based on density peak value clustering and variational Bayes

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] A speaker labeling method and system based on density peak clustering and variational Bayesian proposed by the present invention, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0052] A speaker marking method based on density peak clustering and variational Bayesian proposed by the present invention, the flow chart is as follows figure 1 shown, including the following steps:

[0053] 1) Establish a training speech database, extract the Mel cepstrum feature of the speech signal in the training speech database, obtain the initial model of the general background through the k-means (kmeans) clustering algorithm, and use the Expectationmaximum (EM) iteration to obtain the general background Model; Baum-Welch statistics are extracted according to the established general background model and training voice data, and the subspace model is obtained by factor analysis;

[0054] Specifical...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a speaker marking method and system based on density peak value clustering and variation Bayes, and belongs to the field of voiceprint identification and mode identification. The method includes: firstly establishing a training voice database, and obtaining a general background model and a subspace model; then obtaining an i-vector factor of each fragment of to-be-detected voice data through an i-vector factor extraction method; and obtaining the number of speakers of the to-be-detected voice data and the prior probabilities of the speakers at each moment by employing a density peak value clustering algorithm, performing iterative estimation on the posterior probability of each speaker corresponding to each fragment by employing variational Bayes, and obtaining a speaker marking result. According to the method and system, problems of uncertainty of estimation of initial values of the number of the speakers and the prior probability of each speaker at each moment and large deviation generated by easy influence of the initial values on the speaker marking performance in the prior art are solved, and the accuracy, the stability and the flexibility of speaker marking are improved.

Description

technical field [0001] The invention relates to the fields of voiceprint recognition and pattern recognition, in particular to a speaker marking method and system based on density peak clustering and variational Bayesian. Background technique [0002] The significance of speaker marking technology is that when it is applied to teleconferences and international conferences, it can be saved as meeting records. At the same time, accurate identification of speakers will naturally help subsequent speech processing and semantic recognition. In addition, in the field of monitoring, speaker marking can record the voice and language of the monitored object, and when applied to the field of public security or military, it contributes to the protection of public security and even national security. [0003] Speaker marking addresses the question of who spoke when. Speaker labeling starts with extracting Mel cepstral features for speech. The Mel cepstrum feature takes into account the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/14G10L17/02G10L17/04G10L17/14G10L25/24
CPCG10L15/142G10L15/144G10L17/02G10L17/04G10L17/14G10L25/24
Inventor 何亮徐灿田垚刘艺刘加
Owner 北京华控智加科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products