Robust speaker distinguishing method based on multifactor frequency displacement invariant feature
A frequency displacement and multi-factor technology, applied in voice analysis, instruments, etc., can solve the problems of speaker discrimination performance degradation and poor robustness, and achieve the effect of improving accuracy and reducing interference
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0059] The present invention will be further described below in conjunction with drawings and embodiments.
[0060] Such as figure 2 As shown, the frequency displacement invariant feature extraction method considering multiple factors in the speech of the present invention specifically comprises the following steps:
[0061] (1) Preprocess the speech data x(t) of 51 children in the Tidigits database. The sampling rate is 8kHz. The Hamming window is used for windowing. The window length is 23ms, and the window shift is 10ms. Calculate the energy spectrum S(f,t) of the signal by Fourier transform;
[0062] (2) Use 4 different scales and 4 different phases of two-dimensional complex wavelet transform to filter the energy spectrum S(f, t) to obtain the tensor multi-factor representation of the speech signal here is a size of The 4-order tensor of , each order corresponds to frequency, time, scale and phase; using 36 Mel-scale filter bank pairs Frequency-order filtering o...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com