Emotion speech synthesis method and device based on deep learning
A technology of speech synthesis and deep learning, applied in speech synthesis, speech analysis, instruments, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0078] Such as figure 1 As shown, the present embodiment provides a method for emotional speech synthesis based on deep learning, which belongs to the field of speech synthesis. Through this method, the synthesis of emotional speech can be realized without manual labeling of emotions, and the efficiency of synthetic speech emotion can be effectively improved. Naturalness.
[0079] combine figure 1 , 2 As shown, the method includes the following steps:
[0080] S1. Extract the text information to be processed and the preceding information of the text information to be processed.
[0081] Specifically, when the processing object is a text object, the previous information includes the previous text information;
[0082] When the processing object is a voice object or a video object, the previous information includes previous text information and previous voice information.
[0083] It should be noted that in this step, extracting text information from text objects, extractin...
Embodiment 2
[0127] In order to implement the method for emotional speech synthesis based on deep learning in the first embodiment above, this embodiment provides an apparatus 100 for emotional speech synthesis based on deep learning.
[0128] Figure 5 It is a schematic structural diagram of the deep learning-based emotional speech synthesis device 100, as Figure 5 As shown, the device 100 at least includes:
[0129] Extraction module 1: used to extract the text information to be processed and the previous text information of the text information to be processed, the previous text information includes the previous text information;
[0130] Emotional feature information generation module 2: used to generate emotional feature information through the pre-built first model by taking the text information to be processed and the preceding text information as input;
[0131] Emotional speech synthesis module 3: for synthesizing emotional speech through the pre-trained second model by taking ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com