Voice conversion method based on convolutive nonnegative matrix factorization
A technology of non-negative matrix decomposition and speech conversion, which is applied in speech analysis, speech recognition, speech synthesis, etc., and can solve problems such as limited applications
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0035] to combine figure 1 , the present invention is based on the voice conversion method of convolution non-negative matrix factorization, and the steps are as follows:
[0036] Training phase: The transformation model is trained with the training data.
[0037] The first step is time alignment and parameter decomposition of training speech data:
[0038] (1) Time alignment of voice data, such as figure 2 shown. First, the source speaker's voice in the training data set and the target speaker's voice , through the analysis of the STRAIGHT model, the pitch period information of each sampling point of the two is obtained, that is, the pitch period envelope and :
[0039]
[0040]
[0041] in and source speaker speech and the target speaker's voice The number of sampling points contained in .
[0042] The pitch period here is expressed in the form of the number of sampling points, and the fractional part is rounded to an integer. Since the unvoiced s...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com