Chinese test to voice joint synthesis system and method using rhythm control

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A text-to-speech technology, applied in the field of Chinese text-to-speech splicing systems, can solve problems such as difficulty in obtaining high-quality text and speech, application limitations, etc., achieve efficient syllable synthesis, save storage space, and reduce storage space. Effect

Inactive Publication Date: 2004-01-28

CERENCE OPERATING CO

View PDF0 Cites 15 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Therefore, it is difficult to obtain high-quality text-to-speech on such portable devices

This makes text-to-speech conversion limited in application in these areas

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0016] figure 1 A presently existing Chinese text-to-speech conversion system is shown. The conversion system mainly includes three parts: a text processor 100 , a sound clip library 200 and a synthesis device 300 . The main function of the text processor 100 is to assign corresponding phonetic symbols to Chinese characters in the Chinese text after standardization and word segmentation of the input Chinese text. Then, the obtained phonetic symbol sequence is used to match the phonetic symbol sequence stored in the sound clip library 200 , and then replaced with the corresponding voice or phrase sound clip. Finally, the synthesizing device 300 splices these sound clips according to the order of the Chinese text, and inserts appropriate pause information to obtain the required speech output. The sound clip library 200 stores a large number of Chinese text materials, as well as recordings of human pronunciation of these materials. The amount of these pronunciation materials u...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The system comprises the text processor, the device for controlling sound and rhythm and the synthesizer. Based on input text, the text processor generates a sequence of phonetic sign after word is partitioned. The sound and rhythm control device comprises at least the pronunciation label base, sound location index and the device for selecting difference rhythm vector. The said base includes at least the sound location index and difference rhythm vector. The selection device of difference rhythm vector generates control data including the index and rhythm vector after phonetic sign sequence being received. The synthesizer comprising sound unit parameter base responds to the control data from the sound and rhythm device to generate syntactic voice. The invention provides syntactic voice with good quality, suitable to mini type embedded device.

Description

technical field [0001] The present invention relates to a Chinese text-to-speech (Text-to-Speech, TTS) splicing system, in particular to a Chinese text-to-speech splicing synthesis system and method using prosody control. Background technique [0002] For electronic equipment, such as a large amount of text data stored in computers, mobile phones or personal digital assistants (PDAs), using eyes to read is easy to cause visual fatigue. And sometimes, like in a moving car, reading data on an electronic screen is inconvenient. Therefore, it is hoped that these texts will be converted into speech and played to readers to resolve these contradictions. [0003] At present, high-quality Chinese text-to-speech synthesis technology is basically based on splicing the pronunciation waveforms corresponding to each word, word or phrase in the Chinese text. The required pronunciation waveforms are generally selected from a pronunciation waveform library with a large amount of data, and...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L13/04G10L13/08

CPCG10L13/10G10L13/047

Inventor黄建成陈芳

OwnerCERENCE OPERATING CO

Chinese test to voice joint synthesis system and method using rhythm control

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology