Voice conversion method facing to multi-time scale prosodic features
A multi-time scale and prosodic feature technology, applied in speech analysis, speech recognition, speech synthesis, etc., can solve problems such as correlation
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0051] specific implementation plan
[0052] Below in conjunction with accompanying drawing, the implementation of technical scheme is described in further detail:
[0053] Such as figure 1 , the present invention is based on the double hidden Markov model multi-time scale prosody feature conversion method, the steps are as follows:
[0054] The first step is to pre-process the speech signals of the input source speaker and target speaker, such as pre-emphasis, framing, and windowing, such as figure 2 As shown, according to the grammatical rules of the speech signal and the auditory perception characteristics of the human ear, a sentence can be decomposed into several phrases, and these phrases can completely and independently express a semantic meaning. A phrase can be divided into several syllables, and each syllable is the basic unit of pronunciation. The different prosodic characteristics of speech signals are best represented at different time scales. Speech is divide...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com