Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Speech synthesis method and system

A technology of speech synthesis and alternative speech, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as unbalanced data distribution and affecting the selection of text to be synthesized

Active Publication Date: 2019-10-18
科大讯飞长江信息科技有限公司
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, when the existing synthesis system synthesizes speech, the number of incorrectly synthesized speech units is far less than the number of correctly synthesized speech units, that is, the distribution of the two types of training data for training the classification model is unbalanced, which leads to the training of the classification model. The wrong synthetic unit tends to be the correct synthetic unit, which affects the selection of the optimal synthetic result of the text to be synthesized

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method and system
  • Speech synthesis method and system
  • Speech synthesis method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0093] In order to enable those skilled in the art to better understand the solutions of the embodiments of the present invention, the embodiments of the present invention will be further described in detail below in conjunction with the drawings and implementations.

[0094] Such as figure 1 Shown, is the flow chart of the speech synthesis method of the embodiment of the present invention, comprises the following steps:

[0095] Step 101, receiving text to be synthesized.

[0096] Step 102: Perform preprocessing on the text to be synthesized to obtain a sequence of units to be synthesized and context-related information of the units to be synthesized in the text to be synthesized.

[0097] The preprocessing mainly includes: word segmentation, part-of-speech tagging and prosodic analysis. Taking Chinese as an example, the prosody analysis results of the text to be synthesized "A love story that happened around us" are as follows:

[0098] A #love* story that happened around...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech synthesis method and a speech synthesis system. The speech synthesis method comprises steps that a to-be-synthesized text is preprocessed to acquire the to-be-synthesized unit sequence of the to-be-synthesized text and context related information of to-be-synthesized units; the optimal alternative speech units of the to-be-synthesized units are acquired from a speech database according to the context related information of the to-be-synthesized units, and are spliced together to acquire the alternative speech data of the to-be-synthesized unit sequence; alternative speech data audiometry result of audiometry personnel is acquired; correction models of different acoustic characteristics are trained according to the audiometry result; the optimal alternative speech units of the to-be-synthesized units are reacquired from the speech database according to the correction models and the context related information of the to-be-synthesized units, and are spliced together to acquired optimized speech data; and finally, the optimized speech data used as the synthesized speech data of the to-be-synthesized text is output. Artificial subjective hearing is integrated with the synthesis result of the to-be-synthesized text, and therefore speech synthesis effect is improved.

Description

technical field [0001] The invention relates to the technical field of speech synthesis, in particular to a speech synthesis method and system. Background technique [0002] It has become an urgent need for the application and development of information technology to realize humanized and intelligent effective interaction between man and machine, and to build an efficient and natural human-machine communication environment. As an important part of human-computer communication, speech synthesis technology can convert text information into natural speech signals, endow computers with the ability to speak freely like humans, and change the traditional cumbersome operation of realizing machines to speak through recording and playback. In order to make the synthesized speech more natural and more in line with human subjective hearing, a speech synthesis method that integrates human subjective listening has emerged. The specific fusion method is generally to analyze the artificial...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L13/10G10L13/08G10L25/69G10L25/03
Inventor 夏咸军江源王影胡国平胡郁刘庆峰
Owner 科大讯飞长江信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products