Chinese voice synthesis method based on music instrument digital interface algorithm

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of speech synthesis and Chinese, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as unnatural transition of speech units, high system performance requirements, and huge storage requirements, and achieve portability guarantee and scalability Guaranteed effect

Inactive Publication Date: 2004-04-07

FUDAN UNIV

View PDF0 Cites 1 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, there is still a certain gap between the speech synthesized based on the splicing method and the natural speech. The main performance is that the naturalness of the synthesized speech is not high enough, and the transition between speech units is not natural enough.

However, there are some inherent defects in the above speech synthesis method: either the synthesis algorithm is relatively complicated, and the performance requirements of the system are relatively high; or the storage requirements are quite large

Poor portability and scalability

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0051] For example, PC-based speech synthesizer is used: in the operating system of Win95 and above, it is only necessary to add the corresponding speech waveform library in its wave table file (gm.dls), and the others do not need any modification. Any software that plays MIDI files in the system can synthesize voice signals normally.

[0052] We store a waveform of "āo" and a "ī" in the form of a stringed instrument in the DLS wave table, and the correction parameters of their three tones are: 1. Yangping, no down section and decay section, hold time 0.3 milliseconds, Release time 0.3 milliseconds. The frequency is raised by 600 cents. 2. For the upper sound, there is no sound in the lower part, the decay time is 0.3 milliseconds, the hold time is 0.3 milliseconds, and there is no sound in the release part. The frequency drops by 600 cents. 3. To remove the sound, the press time is 0.1 milliseconds, the decay time is 0.4 milliseconds, the hold section is not, and the relea...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present invention is one kind of Chinese voice synthesis method based on MIDI algorithm. The method includes establishing waveform library via storing Chinese voice including 23 initial consonants and 34 vowels in DLS wavetable, attaching the waveform library to the tail of the standard DLS wavetable, performing ADSR correction to vowels, converting text first into spelling and then into MIDI message, and final synthesizing Chinese character voice via sound card or other MIDI backing-up player. The present invention can compress greatly the memory space, reduce operation amount and save system cost, and has excellent porting performance and expandability.

Description

technical field [0001] The invention belongs to the technical field of Chinese speech synthesis, and specifically relates to a Chinese speech synthesis method based on a musical instrument digital interface algorithm. technical background [0002] Speech synthesis technology is to convert the text information generated by the computer or input from the outside into a speech signal output according to the speech processing rules, so that the computer can read the text information. Speech synthesis technology involves many fields such as acoustics, linguistics, digital signal processing technology, multimedia technology, etc. It is one of the hot technologies that the world's powerful countries are competing to research. At present, speech synthesis technology mainly includes rule-based synthesis and splicing-based synthesis. Rule-based synthesis is mainly to calculate the trajectory formation rules of parameters to complete the parameter synthesis of speech; splicing-based s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L13/00

Inventor陈光梦李涛胡波

OwnerFUDAN UNIV

Chinese voice synthesis method based on music instrument digital interface algorithm

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology