Speech synthetic text processing method based on rhythm structure

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A technology of text processing and speech synthesis, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of low naturalness of synthesized speech and cannot reach the level acceptable to users, and achieve the effect of simplifying prosody control

Inactive Publication Date: 2007-07-18

HEILONGJIANG UNIV

View PDF0 Cites 54 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The first two speech synthesis methods will lead to relatively low naturalness of the synthesized speech output by the computer and the speech synthesis device, and the "machine smell" is too strong. This technology enters the market on a large scale

The reason is that speech synthesis and its prosody control have the following problems: ① The naturalness of continuous synthetic speech needs to be further improved; ② The text analysis process should be able to reflect the rhythm changes in natural speech to enrich the expressiveness of synthetic speech; ③ The prosody control process of speech synthesis should conform to the prosody of natural speech

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0042] Below in conjunction with accompanying drawing and specific embodiment the present invention is described in further detail:

[0043] In conjunction with Fig. 1, the present invention includes the following computer-implementable steps:

[0044] The text regularization step converts the input text string into a legal pronunciation string according to the preset special symbol table, and outputs the legal pronunciation string to the prosodic structure analysis step;

[0045]In the prosodic structure analysis step, the received legal pronunciation strings are sent to the prosodic structure analysis module for processing, and the legal pronunciation strings are marked with prosodic structure information according to the pre-set word segmentation rules and prosodic structure analysis rules, and the prosodic structure information is output. Annotate strings to the linguistic processing step;

[0046] In the linguistic processing step, the received tagged character strings a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

A method for processing voice synthetic text based on rhythm structure includes comparing inputted text with preset special symbol table to output legal pronunciation character string, comparing legal pronunciation character string according to participle rule and rhythm structure analysis rule to output labeled character string with rhythm structure information, comparing labeled character string with preset rhythm rule and phonetic table word by work and outputting label phonetic code string labeled rhythm information.

Description

(1) Technical field [0001] The invention relates to the technical field of speech signal processing, in particular to a prosodic structure-based text processing method in the speech synthesis technology. (2) Background technology [0002] The existing Chinese speech synthesis method is a word-to-sound conversion using a character as a segmentation unit, or a phrase-based text-to-speech conversion using a grammatical word as a segmentation unit. In fact, when people speak, they do not use words or grammatical words as the segmentation unit, but use prosodic words as the segmentation unit. The first two speech synthesis methods will lead to relatively low naturalness of the synthesized speech output by the computer and the speech synthesis device, and the "machine smell" is too strong. This technology has entered the market on a large scale. The reason is that speech synthesis and its prosody control have the following problems: ① The naturalness of continuous synthetic spee...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L13/00G10L13/08G10L13/02

Inventor张鹏王丽红

OwnerHEILONGJIANG UNIV

Speech synthetic text processing method based on rhythm structure

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology