Method for optimizing local synthesis based on distributed natural rhythm

A synthesis method and distributed technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of unstable network environment, slow synthesis response, limited hardware equipment space, etc. The needs of voice, the effect of improving the response speed of synthesis

Active Publication Date: 2013-05-01
IFLYTEK CO LTD
View PDF6 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] With the development of informatization, there are more and more applications of speech synthesis, but the effect of speech synthesis is also limited by some hardware conditions, resulting in a better synthesis effect that cannot be reflected in the application products, such as limited hardware space and poor network environment. Stability, user traffic limit, etc.
Increasing the size of the training library can improve the effect of synthesized speech, but the storage resources will increase, and it is difficult to put it on the terminal device. If the network cloud call method is used, the synthesized response will be slower when the network environment is not good. Voice data will cause relatively large user traffic

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for optimizing local synthesis based on distributed natural rhythm
  • Method for optimizing local synthesis based on distributed natural rhythm
  • Method for optimizing local synthesis based on distributed natural rhythm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] The method for optimizing local synthesis effects based on distributed natural rhythm proposed by the present invention will be explained in detail below in conjunction with the accompanying drawings.

[0043] As shown in Figure 1, the present invention includes information extraction, network transmission and local synthesis process, specifically as follows:

[0044] Step 1: Collect commonly used and fixed texts and record them

[0045] Step 2: According to the recording and the text, manually mark the text to obtain the correct prosody information and store it as a text file;

[0046] Step 3: Use the offline fundamental frequency and duration tool to generate parameters for the voice data, and obtain the phoneme state duration and average fundamental frequency (including the average fundamental frequency static parameters and first-order dynamic parameters) of the corresponding voice, and store them as binary data files.

[0047] The duration information is segmented...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A method for optimizing local synthesis effect based on a distributed natural rhythm is used for transmitting natural voice rhythm information at low bit rate and locally synthesizing. The method comprises the following steps that a recording data or server-class synthetic system is used to generate front end marking information and voice rhythm information for synthesizing a text; and then, the information is downloaded locally in a network mode and is locally used to be combined with a rear end system to synthesizing. Better front end information and rear end rhythm parameters are adopted, the local synthesizing rhythm is increased, and accordingly, the local synthesizing effect is improved; and moreover, a small amount of data is occupied by the fundamental frequency and duration, so that the method has faster responding speed and less flow compared with the conventional network synthesizing mode.

Description

technical field [0001] The invention relates to a local synthesis method based on distributed natural rhythm optimization, which belongs to the application field of speech synthesis, and is mainly used in the synthesis system of electronic products such as mobile phones to improve the rhythm performance of speech synthesis and reduce network traffic. Background technique [0002] With the development of informatization, there are more and more applications of speech synthesis, but the effect of speech synthesis is also limited by some hardware conditions, resulting in a better synthesis effect that cannot be reflected in the application products, such as limited hardware space and poor network environment. Stability, user traffic limit, etc. Increasing the size of the training library can improve the effect of synthesized speech, but the storage resources will increase, and it is difficult to put it on the terminal device. If the network cloud call method is used, the synthe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/10
Inventor 郜静文殷翔孙见青江源刘艳茹袁武文张鑫孙梦娟赵志伟吴晓如
Owner IFLYTEK CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products