Voice synthesis method and device and electronic equipment
A technology of speech synthesis and speech library, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of strong machine sense of sound quality, poor stability outside the set, loss of details of sound quality and tone, etc., to improve the effect and improve the effect. Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0057] figure 1 It is a flowchart of steps of a speech synthesis method provided by an embodiment of the present invention.
[0058] refer to figure 1 As shown, the speech synthesis method provided in this embodiment is applied to electronic devices such as electronic computers or speech synthesis equipment, and specifically includes the following steps:
[0059] S1. Perform text analysis on the input text.
[0060] When the user directly inputs or other electronic equipment inputs the corresponding text, text analysis is performed on the input text, and the target primitive sequence and corresponding context information are obtained therefrom. The target primitive sequence here includes multiple target primitives.
[0061] S2. Using the traditional model decision tree to determine the subcategory numbers to which the contextual information respectively belongs in the voice selection target model of the speech library.
[0062] The voice selection target model here include...
Embodiment 2
[0151] figure 2 It is a structural block diagram of a speech synthesis device provided by an embodiment of the present invention.
[0152] refer to figure 2 As shown, the speech synthesis device provided by this embodiment is applied to electronic equipment such as electronic computers or speech synthesis equipment, and specifically includes a text analysis module 10, a first calculation module 20, a distance calculation module 30, a grid construction module 40, a second The calculation module 50 , the third calculation module 60 , the fourth calculation module 70 , the path selection module 80 and the splicing output module 90 .
[0153] The text analysis module is used to perform text analysis on the input text.
[0154] When the user directly inputs or other electronic equipment inputs the corresponding text, text analysis is performed on the input text, and the target primitive sequence and corresponding context information are obtained therefrom. The target primitive...
Embodiment 3
[0198] This embodiment provides an electronic device, such as a speech synthesis device, an electronic computer, or a mobile terminal, which is provided with the speech synthesis device provided in the previous embodiment. The device is used for text analysis of the input text to obtain the target primitive sequence and corresponding context information; for the context information, the traditional model decision tree is used to determine the subcategories of the context information in the statistical model of the speech library number and the corresponding Gaussian distribution model to obtain the corresponding pre-selection results; use the pre-selection results to form a column for each target primitive in turn, and finally make the sequence of target primitives form a set of candidate grids; input the context information into the deep learning model, Get the acoustic parameter envelope, primitive duration, and boundary frame acoustic parameters of each target primitive in t...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com