Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech synthesis method and system

A technology of speech synthesis and syllables, which is applied in the field of speech synthesis methods and systems, which can solve the problems of voice quality degradation and loss of voice information in synthesized speech, and achieve the effect of full and mellow timbre and reduced data storage space

Inactive Publication Date: 2010-11-24
BEIJING SINOVOICE TECH CO LTD
View PDF5 Cites 202 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] Since the parameter model is obtained through parameter extraction and model statistics, this method can compress the storage space compared to the pre-stored voice data; however, some voice information will be lost in the process of parameter extraction and model statistics, so it will be Causes degradation of sound quality of synthesized speech

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method and system
  • Speech synthesis method and system
  • Speech synthesis method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0056] One of the core ideas of the embodiments of the present invention is to use the spectral parameter database to store the spectral parameters of a specific syllable, so that when the user enters text online, the syllable name and context in the text can be used to plan the spectral parameters based on the statistical parameter model. The duration and fundamental frequency parameters are matched from the spectral parameter database to obtain corresponding spectral parameters, and then the voice data of the text is obtained by using a synthesizer.

[0057] refer to figure 1 , which shows a flow chart of an embodiment of a speech synthesis method in the present invention, which may specifically include:

[0058] Step 101, rece...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a speech synthesis method and a speech synthesis system. The method comprises: receiving a text input by a user; performing text analysis to obtain a syllable sequence corresponding to the text and the syllable name of each syllable in the syllable sequence; for each syllable in the syllable sequence, planning and acquiring a corresponding duration parameter and a corresponding basic frequency parameter by combining a statistic parameter model according to the syllable name and context; for each syllable in the syllable sequence, acquiring corresponding spectrum parameter by matching from a spectrum parameter database according to the syllable name, the context, the duration parameter and the basic frequency parameter; and acquiring speech data corresponding to the syllable sequence by using a synthesizer according to the duration parameter, duration parameter, basic frequency parameter and spectrum parameter of each syllable in the syllable sequence. The method and the system can be used in embedded equipment and effectively reduce data storage space occupation while achieving a high tone quality.

Description

technical field [0001] The invention relates to the technical field of speech synthesis, in particular to a speech synthesis method and system. Background technique [0002] Speech synthesis technology, also known as text-to-speech (TTS, Text To Speech) technology, can convert any text information into a standard and smooth voice to read out. [0003] In the current speech synthesis, there are mainly two methods: [0004] One is the waveform splicing method; [0005] The basic idea is to pre-record a speech library, and when synthesizing, according to the results of text analysis and prosody prediction, directly select the appropriate recording segments from the speech library, and finally stitch the selected recording segments together. [0006] Due to the use of original recordings, the sound quality of synthesized speech can be guaranteed; however, to obtain better synthesis results, the speech library needs to store a large amount of speech data in advance, and the syn...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/08G10L15/14G10L15/06G10L19/00
Inventor 李健张连毅武卫东
Owner BEIJING SINOVOICE TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products