Text to speech (TTS) method and system

A speech synthesis and speech technology, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as word segmentation, prosody prediction deviation in speech unit selection, inability to generate absolutely correct front-end results, inability to synthesize human-synthesized speech, etc., to achieve Improving comprehension, compensating for lack of prediction accuracy, and reducing the size of the effect

Active Publication Date: 2013-10-23
SHANGHAI GUOKE ELECTRONICS
View PDF5 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, whether it is a traditional TTS system or a distributed TTS system, there is always a problem: with the current artificial intelligence technology, it is impossible to generate absolutely correct front-end results, and there may be deviations in word segmentation, prosody prediction, and phonetic unit selection. The front-end result plays a decisive role in the final synthesis result. A good front-end result greatly improves the intelligibility, naturalness and user acceptance, while a bad front-end result may make the synthesis result far from the text.
Although the common speech synthesis algorithms can synthesize high-quality and natural synthetic speech, they are all based on high-quality front-end analysis results. If there is no high-quality front-end text analysis results as a basis, any speech synthesis No algorithm can synthesize acceptable synthetic speech

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text to speech (TTS) method and system
  • Text to speech (TTS) method and system
  • Text to speech (TTS) method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0037] Such as Figure 3-4 As shown, the present invention provides a method for speech synthesis, comprising:

[0038] Step S1, the front-end performs text analysis and language analysis on the input text, and generates a front-end script containing corresponding speech units, specifically, as Figure 4 As shown, the front-end is set on the server, and the front-end can obtain the input text, and through a series of processing processes such as text analysis and language analysis, the input text is converted into a front-end script (intermediate data), and the output front-end script will be processed by the back-end It is used to synthesize speech, or to be verified and modified by the interactive verification terminal. Since t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a text to speech (TTS) method and system. The method comprises: performing text analysis and linguistic analysis on an inputted text so as to generate front-end scripts comprising corresponding speech units; obtaining, verifying and correcting the front-end scripts; and obtaining the corrected front-end scripts for synthesizing correction speech. By adopting the provided TTS method and system, the errors of the front-end scripts such as word-segmentation errors and polyphone phonetic notation errors can be corrected so that the synthesized speech is more understandable and more user friendly for users, a conventional TTS's shortcomings of insufficient prediction accuracy of rhythm can be overcome, and the synthesized speech is more natural and more expressive.

Description

technical field [0001] The invention belongs to the technical field of speech synthesis, in particular to a speech synthesis method and system. Background technique [0002] The traditional TTS (Text to Speech speech synthesis) system consists of two parts, the front end and the back end. The front end is mainly responsible for text preprocessing and speech unit generation, and the back end is mainly responsible for speech synthesis. Such as figure 1 As shown, both the front end and the back end of the traditional TTS system are set on the client side. The traditional TTS system has many processing links and high computational complexity, which puts forward higher requirements for the computing power and storage capacity of the computer, especially for the emerging mobile Terminal devices such as personal digital assistants, e-books, and mobile phones present great challenges. [0003] Therefore, a distributed TTS system came into being, such as figure 2 As shown, the f...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/08G10L13/10
Inventor 王玉平
Owner SHANGHAI GUOKE ELECTRONICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products