Voice synthesizing apparatus and method based on rhythm reference

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A speech synthesis and prosody technology, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as inability to accurately grasp text prediction, inability to control the connection of speech synthesis units, and inability to control the cadence of synthesized speech well. achieve the effect of improved naturalness

Inactive Publication Date: 2010-03-31

FUJITSU LTD

View PDF0 Cites 25 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Because there are still many unavoidable problems in text analysis and prosody prediction, the traditional speech synthesis system cannot accurately grasp the content of the text and the prediction of prosody parameters, and cannot well control the connection between various speech synthesis units. Well control the cadence of synthesized speech, which ultimately leads to the user's unsatisfactory speech

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0051] These and other aspects of the invention will become apparent with reference to the following description and drawings. In these descriptions and drawings, some specific embodiments of the present invention are specifically disclosed to represent some ways of implementing the principles of the present invention, but it should be understood that the scope of the present invention is not limited thereto. On the contrary, the invention includes all changes, modifications and equivalents coming within the spirit and scope of the appended claims.

[0052] Features described and / or exemplified for one embodiment can be used in the same or similar manner in one or more other embodiments, and / or in combination with or instead of features of other embodiments .

[0053] It should be emphasized that the word "comprising" when used in this specification is used to refer to the presence of stated features, integers, steps or components, but does not exclude the presence of one or ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a voice synthesizing apparatus and a method based on rhythm reference. The voice synthesizing apparatus comprises a rhythm parameter acquiring part and a sound composing part, wherein the rhythm parameter acquiring part acquires natural rhythm parameters or proximate natural rhythm parameters by analyzing voice record files acquired by ways including the way that natural person reads texts to be synthesized or analyzing rhythm parameter label files acquired by labeling rhythm parameter for texts to be synthesized with a predetermined label standard; and the sound composing part uses the natural rhythm parameters or proximate natural rhythm parameters as references, selects corresponding voice synthesizing units from a pre-record voice library aiming at the texts to be synthesized and splices to synthesize the voice synthesizing units to generate synthesized voice files corresponding to the texts to be synthesized. Highly natural, motional synthesized voices having cadence extremely close to natural voices are generated according to requirements of users according to the synthesizing apparatus and method.

Description

technical field [0001] The present invention relates to a device and method for speech synthesis based on prosodic reference. More specifically, the present invention relates to using the prosodic features obtained from natural speech or prosodic feature annotation files based on specific standards as a reference to synthesize speech with A device and method for synthesizing speech with high naturalness. Background technique [0002] Speech synthesis (Text-To-Speech, TTS for short) is a technology for converting text to speech, specifically, a technology for converting arbitrary text information into standard, fluent speech. Speech synthesis involves many cutting-edge high-tech technologies such as natural language processing, prosody, speech signal processing, and sound perception. It spans multiple disciplines such as acoustics, linguistics, and digital signal processing. It is a cutting-edge technology in the field of Chinese information processing. . [0003] Speech sy...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L13/02G10L13/08G10L13/10

Inventor郭庆陆应亮王彬

OwnerFUJITSU LTD

Voice synthesizing apparatus and method based on rhythm reference

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology