Method, device and equipment for synthesizing voice and music
A speech and music technology, applied in the field of speech synthesis, which can solve the problems of unsatisfactory speech flow quality and inability to meet the needs of speech and music synthesis.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0108] refer to figure 1 , which shows a flow chart of steps in Embodiment 1 of a method for synthesizing speech and music provided by an embodiment of the present invention, which may specifically include the following steps:
[0109] Step 101, obtain input voice data and background music data;
[0110] In the embodiment of the present invention, the voice data can be understood as the voice data formed by people who do not require regularity, and the voice and speed of speech can be erratic; the background music data can be understood as having a certain rhythm and regularity. Music data formed by a combination of tones. The "background music data" referred to in the embodiment of the present invention is essentially "music data". The word "background" is only used to emphasize its background as voice data synthesis, and does not mean that it has certain technical characteristics .
[0111] In a specific implementation, the voice data can be the voice data transmitted by ...
Embodiment 2
[0176] refer to Figure 4 , which shows a flow chart of steps in Embodiment 2 of a method for synthesizing speech and music provided by an embodiment of the present invention, which may specifically include the following steps:
[0177] Step 201, obtaining input voice data and background music data;
[0178] Step 202, identifying one or more single characters or words that make up the voice data from the voice data, and obtaining the pitch and duration of the one or more single characters or words;
[0179] Step 203, acquiring the pitch and duration of the background music data;
[0180] Step 204, according to the pitch and duration of the background music data, change the speed and / or transpose the pitch and duration of the one or more words or words;
[0181] Step 205, perform special effect processing on the speech data after the speed change and / or pitch shift processing, the special effect processing includes: echo special effect processing, and / or, T-Pain special effec...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com