Correction method for Chinese speech synthesis tone

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speech synthesis and tone technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of uneven fundamental frequency, intelligibility and naturalness decline, etc.

Active Publication Date: 2012-06-13

北京宇音天下科技有限公司 +1

View PDF3 Cites 25 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] Since the pitch accuracy of a single syllable plays a vital role in the intelligibility and naturalness of the synthesized speech in Chinese synthesis, the Hidden Markov Model belongs to a segmented model segmented by state. Each segment are independent of each other, causing the fundamental frequency within a syllable to appear uneven, resulting in a significant decline in intelligibility and naturalness

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0054] The present invention will be further described below in conjunction with the accompanying drawings and examples, and the steps and processes for realizing the present invention will be better described through a detailed description of each key step of the method in conjunction with the accompanying drawings. It should be pointed out that the described examples are only considered for the purpose of illustration, not limitation of the present invention.

[0055] attached figure 1 It is a schematic diagram of the tone correction method for Chinese speech synthesis proposed by the present invention. The implementation method is written in standard C language, and can be compiled and run under both windows platform and unix platform. in the attached figure 1 In the preferred embodiment of the present invention, the method is divided into two parts: an offline training module 2 and a parametric speech synthesis module 6 . Wherein, the offline training module 2 is not co...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a correction method for Chinese speech synthesis tone. According to the invention, a text analysis module receives optional text information to be synthetized; integral synthesis tagging information is outputted according to the syllable and rhythm hierarchical structure; a parameter voice synthesis module receives the synthesis tagging information of the text analysis module; synthetic voice signal is outputted through a parameter generation method of reftone; an off-line training module is responsible for the training of hidden Markov models; a reftone model is used for generating individual syllabic reference base frequency envelope; and a synthesis parameter model is used for gaining synthetic parameter sequence. The invention can solve the problem that the Chinese speech synthesis middle tone based on the hidden Markov model is unstable, thereby greatly improving the natural degree and rhythm of the synthetic speech.

Description

technical field [0001] The invention designs a parameterized speech synthesis method, and in particular relates to a tone correction method for Chinese speech synthesis. Background technique [0002] The goal of speech synthesis technology is to make electronic devices sound like humans. With the development of speech synthesis technology, the sound quality, naturalness, and intelligence of synthesized voices have been greatly improved, and the most rapid development is speech synthesis technology based on parametric statistical models. Parametric statistical speech synthesis technology based on Hidden Markov Model is a representative of this kind of method. The synthesized sound quality has high coherence and flexibility, and the required resources occupy less space, which has great practicality and research value. This method is divided into two parts, one is the offline model training part, and the other is the online speech synthesis part. In the offline training part...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L13/02

Inventor那兴宇王朝民谢湘何娅玲

Owner北京宇音天下科技有限公司

Correction method for Chinese speech synthesis tone

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology