Supercharge Your Innovation With Domain-Expert AI Agents!

Voice synthetic method, voice synthetic device and recording medium

A speech synthesis and pitch technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of fine spectrum structure that cannot represent actual speech, lack of human voice sense, poor model accuracy, etc.

Inactive Publication Date: 2005-01-19
KK TOSHIBA
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there is the problem of poor model accuracy
That is to say, only using the formant frequency and frequency bandwidth cannot express the fine structure of the spectrum of the actual speech, and the sound quality is poor and lacks the sense of human voice (human-like degree)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice synthetic method, voice synthetic device and recording medium
  • Voice synthetic method, voice synthetic device and recording medium
  • Voice synthetic method, voice synthetic device and recording medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] Embodiments of the present invention will be described below with reference to the drawings.

[0031] FIG. 1 shows the configuration of a speech synthesis device for realizing a speech synthesis method according to an embodiment of the present invention. The sound synthesis device receives the pitch pattern 306 , the phoneme duration 307 and the phoneme symbol string 308 , and outputs a synthesized speech signal 305 . The above-mentioned speech synthesis device is composed of a voiced speech synthesis unit 31 and an unvoiced speech synthesis unit 32, and generates a synthesized speech signal 305 by adding the unvoiced speech signal 304 and the spoken speech signal 303 respectively output from these synthesis units.

[0032] The unvoiced speech synthesis unit 32 refers to the phoneme duration 307 and the phoneme symbol string 308 to generate the unvoiced speech signal 304 when the phonemes are mainly unvoiced consonants and voiced fricatives. The unvoiced speech synthes...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A speech synthesis method comprises selecting a predetermined formant parameters from formant parameters according to a pitch pattern, phoneme duration, and phoneme symbol string, generating a plurality of sine waves based on formant frequency and formant phase of the formant parameters selected, multiplying the sine waves by windowing functions of the selected formant parameters, respectively, to generate a plurality of formant waveforms, adding the formant waveforms to generate a plurality of pitch waveforms, and superposing the pitch waveforms according to a pitch period to generate a speech signal.

Description

[0001] Cross References to Related Applications [0002] This application is based on and claims priority from the prior Japanese Patent Application No. 2001-08704 filed on March 26, 2001, the entire contents of which are incorporated herein by reference. technical field [0003] The invention relates to text-to-speech synthesis, in particular to the speech synthesis to generate speech signals from information such as phoneme symbol strings, pitches, and phoneme durations. Background technique [0004] Making speech signals from arbitrary texts is called text-to-speech synthesis. Usually, this text-to-speech synthesis system includes three stages: a speech processing unit, a phoneme processing unit, and a speech signal generation unit. [0005] The input text is first subjected to morphological analysis and composition analysis in the speech processing unit, and then the stress and intonation are processed in the phoneme processing unit, and the phoneme symbol string, pitch...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L13/00G10L13/06
CPCG10L25/27G10L13/04G10L13/027
Inventor 笼嶋岳彦赤岭政巳
Owner KK TOSHIBA
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More