Chinese rhythm boundary prediction method based on graph-to-sequence
A prosodic boundary prediction, prosodic boundary technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problem of not combining temporal information and spatial information.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0024] The present invention will be further described and proved in detail below in combination with the experimental process and experimental results.
[0025] Based on the basic framework of the current general sequence prediction Bilstm-CRF, the present invention proposes the representation of text space information from the perspective of text analysis, and on this basis, combines the time series information and space information of the text for the first time To improve the results of prosodic boundary prediction in speech synthesis. The main points of the specific technical plan are divided into the following three parts:
[0026] (1) Basic structure of sequence prediction
[0027] At present, in the prosody prediction module of speech synthesis, the most common method in the industry is Bilstm-CRF. Among them, the text embedding vector input by Bilstm outputs the features extracted in the time domain. The output of Bilstm is also the input of CRF. According to the c...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com