Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and system for improving synthetic voice rhythm naturalness

A technology for synthesizing speech and rhythm, applied in speech synthesis, speech analysis, instruments, etc., it can solve problems such as negative effects and inaccurate prediction results, and achieve accurate prediction of weak reading, efficient prediction of weak reading, and improved naturalness. Effect

Active Publication Date: 2016-08-24
IFLYTEK CO LTD
View PDF6 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there is a lot of uncertainty in stress prediction, and the prediction results are often not accurate enough, especially in texts with unlimited content, which is more likely to cause problems, and it will bring obvious negative effects when the stress information is used in an inappropriate place

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for improving synthetic voice rhythm naturalness
  • Method and system for improving synthetic voice rhythm naturalness
  • Method and system for improving synthetic voice rhythm naturalness

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0065] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0066] The existing accent prediction methods based on semantic analysis have great uncertainty, and the prediction results are often not accurate enough. The main reasons for this analysis are as follows:

[0067] 1. Generally speaking, the content words (such as nouns, verbs, etc.) occupying the vast majority of dictionaries may be reread, and it is an impossible task to list them exhaustively.

[0068] 2. It is difficult to determine stressed words only by ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention discloses a method and system for improving synthetic voice rhythm naturalness. The method comprises a step of receiving a text to be synthesized, a step of determining the basic synthesis unit sequence corresponding to the text, wherein, the basic synthesis unit sequence comprises one or more basic synthesis units, a step of determining whether each basic synthesis unit is weak reading or not, a step of obtaining the synthesis parameter model corresponding to the basic synthesis unit, carrying out weak reading processing on the synthesis parameter model corresponding to the basic synthesis unit if the basic synthesis unit is weak reading, and obtaining an updated synthesis parameter model, a step of generating the synthesis parameter model sequence corresponding to the basic synthesis unit sequence, and a step of generating continuous voice according to the synthesis parameter model sequence. By using the method and the system, the naturalness of continuous synthetic voice can be simply and effectively improved.

Description

technical field [0001] The invention relates to the technical field of speech synthesis, in particular to a method and system for improving the naturalness of synthesized speech prosody. Background technique [0002] It has become an urgent need for the application and development of information technology to realize humanized and intelligent effective interaction between man and machine, and to build an efficient and natural human-machine communication environment. Speech synthesis technology converts text information into natural voice signals, realizes real-time conversion of any text, changes the traditional cumbersome operation of realizing machine speaking through recording and playback, and saves system storage space. It plays an increasingly important role in dynamic query applications where information content needs to change frequently. [0003] In recent years, with the development of the needs of the information society, users have put forward higher requirement...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/10
Inventor 祖漪清王祖燕黄维邵鹏飞胡国平胡郁刘庆峰
Owner IFLYTEK CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products