Method for synthesizing emotional speech by utilizing transfer learning under low resources
A technology of speech synthesis and transfer learning, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of high acquisition cost and unconditional access to data sets, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0045] The present invention will be further described in detail below with reference to the accompanying drawings and specific embodiments. It should be understood that the specific embodiments described herein are only used to explain the present invention, but not to limit the present invention.
[0046] This embodiment provides a method for emotional speech synthesis using transfer learning under low resources. In the actual operation of this embodiment, two data sets: EMOV-DB and LJSpeech-1.1 are used, wherein the EMOV-DB data set is low The emotional speech synthesis dataset of the resource, the text in the dataset is based on the CMU Arctic database. The dataset includes recordings of four speakers - two men and two women. Emotion types include neutral, sleepy, angry, disgusted, and entertaining. The LJSpeech-1.1 dataset is a single-person emotion-neutral speech synthesis dataset containing 13,100 short audio clips from a single speaker from 7 non-fiction books. Tran...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com