Voice synthetic method, voice synthetic device and recording medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech synthesis and pitch technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of fine spectrum structure that cannot represent actual speech, lack of human voice sense, poor model accuracy, etc.

Inactive Publication Date: 2005-01-19

KK TOSHIBA

View PDF1 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, there is the problem of poor model accuracy

That is to say, only using the formant frequency and frequency bandwidth cannot express the fine structure of the spectrum of the actual speech, and the sound quality is poor and lacks the sense of human voice (human-like degree)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0030] Embodiments of the present invention will be described below with reference to the drawings.

[0031] FIG. 1 shows the configuration of a speech synthesis device for realizing a speech synthesis method according to an embodiment of the present invention. The sound synthesis device receives the pitch pattern 306 , the phoneme duration 307 and the phoneme symbol string 308 , and outputs a synthesized speech signal 305 . The above-mentioned speech synthesis device is composed of a voiced speech synthesis unit 31 and an unvoiced speech synthesis unit 32, and generates a synthesized speech signal 305 by adding the unvoiced speech signal 304 and the spoken speech signal 303 respectively output from these synthesis units.

[0032] The unvoiced speech synthesis unit 32 refers to the phoneme duration 307 and the phoneme symbol string 308 to generate the unvoiced speech signal 304 when the phonemes are mainly unvoiced consonants and voiced fricatives. The unvoiced speech synthes...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

A speech synthesis method comprises selecting a predetermined formant parameters from formant parameters according to a pitch pattern, phoneme duration, and phoneme symbol string, generating a plurality of sine waves based on formant frequency and formant phase of the formant parameters selected, multiplying the sine waves by windowing functions of the selected formant parameters, respectively, to generate a plurality of formant waveforms, adding the formant waveforms to generate a plurality of pitch waveforms, and superposing the pitch waveforms according to a pitch period to generate a speech signal.

Description

[0001] Cross References to Related Applications [0002] This application is based on and claims priority from the prior Japanese Patent Application No. 2001-08704 filed on March 26, 2001, the entire contents of which are incorporated herein by reference. technical field [0003] The invention relates to text-to-speech synthesis, in particular to the speech synthesis to generate speech signals from information such as phoneme symbol strings, pitches, and phoneme durations. Background technique [0004] Making speech signals from arbitrary texts is called text-to-speech synthesis. Usually, this text-to-speech synthesis system includes three stages: a speech processing unit, a phoneme processing unit, and a speech signal generation unit. [0005] The input text is first subjected to morphological analysis and composition analysis in the speech processing unit, and then the stress and intonation are processed in the phoneme processing unit, and the phoneme symbol string, pitch...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L13/00G10L13/06

CPCG10L25/27G10L13/04G10L13/027

Inventor 笼嶋岳彦赤岭政巳

Owner KK TOSHIBA

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Voice synthetic method, voice synthetic device and recording medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology