Voice synthesizing apparatus and method based on rhythm reference

A speech synthesis and prosody technology, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as inability to accurately grasp text prediction, inability to control the connection of speech synthesis units, and inability to control the cadence of synthesized speech well. achieve the effect of improved naturalness

Inactive Publication Date: 2010-03-31
FUJITSU LTD
View PDF0 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Because there are still many unavoidable problems in text analysis and prosody prediction, the traditional speech synthesis system cannot accurately grasp the content of the text and the prediction of prosody parameters, and cannot well control the connection between various speech synthesis units. Well control the cadence of synthesized speech, which ultimately leads to the user's unsatisfactory speech

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice synthesizing apparatus and method based on rhythm reference
  • Voice synthesizing apparatus and method based on rhythm reference
  • Voice synthesizing apparatus and method based on rhythm reference

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] These and other aspects of the invention will become apparent with reference to the following description and drawings. In these descriptions and drawings, some specific embodiments of the present invention are specifically disclosed to represent some ways of implementing the principles of the present invention, but it should be understood that the scope of the present invention is not limited thereto. On the contrary, the invention includes all changes, modifications and equivalents coming within the spirit and scope of the appended claims.

[0052] Features described and / or exemplified for one embodiment can be used in the same or similar manner in one or more other embodiments, and / or in combination with or instead of features of other embodiments .

[0053] It should be emphasized that the word "comprising" when used in this specification is used to refer to the presence of stated features, integers, steps or components, but does not exclude the presence of one or ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a voice synthesizing apparatus and a method based on rhythm reference. The voice synthesizing apparatus comprises a rhythm parameter acquiring part and a sound composing part, wherein the rhythm parameter acquiring part acquires natural rhythm parameters or proximate natural rhythm parameters by analyzing voice record files acquired by ways including the way that natural person reads texts to be synthesized or analyzing rhythm parameter label files acquired by labeling rhythm parameter for texts to be synthesized with a predetermined label standard; and the sound composing part uses the natural rhythm parameters or proximate natural rhythm parameters as references, selects corresponding voice synthesizing units from a pre-record voice library aiming at the texts to be synthesized and splices to synthesize the voice synthesizing units to generate synthesized voice files corresponding to the texts to be synthesized. Highly natural, motional synthesized voices having cadence extremely close to natural voices are generated according to requirements of users according to the synthesizing apparatus and method.

Description

technical field [0001] The present invention relates to a device and method for speech synthesis based on prosodic reference. More specifically, the present invention relates to using the prosodic features obtained from natural speech or prosodic feature annotation files based on specific standards as a reference to synthesize speech with A device and method for synthesizing speech with high naturalness. Background technique [0002] Speech synthesis (Text-To-Speech, TTS for short) is a technology for converting text to speech, specifically, a technology for converting arbitrary text information into standard, fluent speech. Speech synthesis involves many cutting-edge high-tech technologies such as natural language processing, prosody, speech signal processing, and sound perception. It spans multiple disciplines such as acoustics, linguistics, and digital signal processing. It is a cutting-edge technology in the field of Chinese information processing. . [0003] Speech sy...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/02G10L13/08G10L13/10
Inventor 郭庆陆应亮王彬
Owner FUJITSU LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products