Speech synthetic method based on rhythm character

A speech synthesis and prosody feature technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of complex prosody model and prosody control, low naturalness of synthesized speech, and unacceptable level of users.

Inactive Publication Date: 2011-03-30
HEILONGJIANG UNIV
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The first two speech synthesis methods will lead to relatively low naturalness of the synthesized speech output by the computer and the speech synthesis device, and the "machine smell" is too strong. This technology enters the market on a large scale
The reason is that speech synthesis and its prosody control have the following problems: ① The naturalness of continuous synthetic speech needs to be further improved; ② The text analysis process should be able to reflect the rhythm changes in natural speech to enrich the expressiveness of synthetic speech; ③ The prosody control process of speech synthesis should conform to the prosody of natural speech
In practice, it is found that the prosodic model and prosodic control of Chinese are extremely complex, and the primitive selection method based on one method cannot meet the requirements of speech synthesis for prosodic features and prosodic control in the current situation.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthetic method based on rhythm character
  • Speech synthetic method based on rhythm character
  • Speech synthetic method based on rhythm character

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0115] 1. Text processing program

[0116] 1.1. Text regularization

[0117] In conjunction with Figures 2-5, the present invention processes the input text through a text regularization step, with the purpose of correcting information with special symbols such as dates, numbers, weather forecasts, and house numbers in the input text according to the correct reading method. Enter text to mark; for example: date "2000-12-12" is marked as "December 12, 2000", "minimum temperature at night -12°C" is marked as "minimum temperature at night -12°C", etc. The output of the text regularization device is a legal pronunciation character sequence, as shown in Table 1.

[0118] Table 1 Relationship between special symbols and input text

[0119]

character type

input character format

special symbol reading

Character order of legal pronunciation

List

date

2000-12-12

The first "-" is read as "year"

The second "-" is r...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a speech synthesis method based on rhythm character, which can improve rhythm control method, and further raise Chinese speech synthesis naturalness. The speech synthesis method includes the following computer realizable steps: text processing program formed by text standardizing step, rhythm structure analysis step and language treatment step; synthetic element selecting program formed by element confirming step, matching step, pasting-up step, optimizing and screening step; speech synthesis processing program formed by base frequency outline generating step of phrase unit, base frequency outline generating step of syllable unit and intonation superposing step. According to Chinese voice character, Chinese tone and feature, and Chinese intonation and mode, the invention constructs a set of complete speech synthesis method based on rhythm character, wherein the steps and modules are all computer program treatment process, and have good universality and transportability, wide application scope and occasion.

Description

(1) Technical field [0001] The invention relates to the technical field of speech signal processing, in particular to a speech synthesis method based on prosodic features in the speech synthesis technology. (2) Background technology [0002] The existing Chinese speech synthesis method is a word-to-sound conversion using a character as a segmentation unit, or a phrase-based text-to-speech conversion using a grammatical word as a segmentation unit. In fact, when people speak, they do not use words or grammatical words as the segmentation unit, but use prosodic words as the segmentation unit. The first two speech synthesis methods will lead to relatively low naturalness of the synthesized speech output by the computer and the speech synthesis device, and the "machine smell" is too strong. This technology has entered the market on a large scale. The reason is that speech synthesis and its prosody control have the following problems: ① The naturalness of continuous synthetic s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L13/00G10L13/02G10L13/04G10L13/08
Inventor 张鹏王丽红
Owner HEILONGJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products