Method and device for forecasting duration of speech synthesis unit

A technology of speech synthesis and duration, applied in the field of information processing, can solve problems such as single
CN102231276BActive Publication Date: 2013-03-20BEIJING SINOVOICE TECH CO LTD

Patent Information

Authority / Receiving Office
CN Β· China
Patent Type
Patents(China)
Current Assignee / Owner
BEIJING SINOVOICE TECH CO LTD
Publication Date
2013-03-20

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention provides a method and device for forecasting duration of a speech synthesis unit. The method comprises the steps of: aiming at context environmental parameters, carrying out initial forecasting on the duration of the speech synthesis unit by utilizing a stepwise linear regression duration forecasting model so as to obtain an initial duration forecasting result; and distributing the initial duration forecasting result by utilizing a decision tree-Gaussian mixture model so as to obtain a distributed duration forecasting result. According to the method and device which are providedby the invention, the accuracy of the duration forecasting result can be increased to ensure that a speech synthesized in a speech synthesis system has a real sense of rhythm.
Need to check novelty before this filing date? Find Prior Art

Description

Technical field

[0001] The present invention relates to the technical field of information processing, in particular to a method and device for training a duration prediction model of stepwise linear regression, and a method and device for predicting the duration of a speech synthesis unit. Background technique

[0002] In a speech synthesis system (Text-to-Speech, TTS), the prediction and generation of the duration of a speech synthesis unit is an indispensable step, which plays a vital role in the prosody hearing of synthesized speech.

[0003] According to the theories of phonetics and phonology, the duration and other characteristics of the speech synthesis unit are determined by its context. The prediction of speech duration is essentially a mapping from the value space of the context environment parameter to the duration value space. For the analysis and modeling method of this kind of mapping relationship, the existing time length prediction method usually adopts the decisi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More