Multi-background modeling method for speaker recognition
A speaker recognition and background model technology, applied in the field of speech recognition, can solve the problem of not necessarily accurate division, and achieve the effect of overcoming inaccurate data division, overcoming the lack of fineness, and improving accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0037] In the GMM-UBM system, the establishment of the UBM model is a crucial step. However, there is still no complete set of theoretical guidance on how to select UBM training data. Researchers can only select according to the final experimental results based on experience. Generally speaking, there are two types of gender-independent UBM and gender-related UBM, among which the performance of gender-related UBM is more superior. The invention promotes the gender-related UBM, divides the training data according to the channel length, and obtains a plurality of background models, and can be divided into three modules for specific implementation.
[0038] Module 1: Multi-background model training module
[0039] Firstly, it is necessary to obtain the bending coefficient of the channel length of the training UBM data. In this step, the maximum likelihood criterion is used to obtain it. First use all the training data to train a "neutral" GMM model with the Baum-Welch algorithm,...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com