Text message based waveform concatenation speech synthesis method

A technology of waveform splicing and text information, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of unsatisfactory stability of synthesized speech, lack of consideration of the influence of text information, and poor speech rhythm performance, so as to enhance real-time performance. , high naturalness, the effect of reducing the number of

Active Publication Date: 2014-10-22
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF4 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, although this method can synthesize waveforms that are relatively close to the original speech, it is limited by the size of the corpus, and the stability of the synthesized speech is not ideal (the sound library is too large, the synthetic speech speed is slow, and it cannot be synthesized in real time; the sound library is too small. , synthetic speech is unstable), which greatly affects the sense of hearing
Moreover, the existing splicing synthesis system lacks consideration of the influence of text information on primitives when calculating the cost, and the synthesized speech is not very good in prosodic performance.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text message based waveform concatenation speech synthesis method
  • Text message based waveform concatenation speech synthesis method
  • Text message based waveform concatenation speech synthesis method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in further detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0022] It should be noted that, in the drawings or descriptions of the specification, similar or identical parts all use the same figure numbers. Implementations not shown or described in the accompanying drawings are forms known to those of ordinary skill in the art. Additionally, while illustrations of parameters including particular values ​​may be provided herein, it should be understood that the parameters need not be exactly equal to the corresponding values, but rather may approximate the corresponding values ​​within acceptable error margins or design constraints.

[0023] The method of the present invention combines the text features of the speech to be synthesized and the original speech, and firstly performs la...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a text message based waveform concatenation speech synthesis method. The text message based waveform concatenation speech synthesis method includes steps of S1, extracting acoustic parameters and text parameters of all elements in an original voice frequency through segment cutting, and training a duration prediction model and a weight prediction model according to extracted parameters; S2, using a layered pre-selection method to primarily pre-select the elements in a corpus to obtain candidate elements by means of a target element of text analysis and a duration predicted by the duration prediction model; S3, calculating the target element, the candidate elements, and weight information predicted by the weight prediction model to obtain a target cost; calculating Integrating degrees of two adjacent elements to obtain a concatenation cost; using a viterbi searching method to search the target cost and the concatenation cost to obtain a minimum cost path so as to further obtain an optimum element and obtain synthesis speeches through smooth concatenation.

Description

technical field [0001] The invention belongs to the field of intelligent information processing, and relates to a waveform splicing system based on text information. Background technique [0002] Speech is one of the main means of human-computer interaction, and the main purpose of speech synthesis is to enable computers to generate high-definition, high-naturalness continuous speech. There are two main methods of speech synthesis. Early research mainly uses parametric speech synthesis, and the most commonly used synthesis method is based on Hidden Markov parameter speech synthesis method. As a specific implementation of statistical acoustic modeling method, this method performs Hidden Markov Modeling on the acoustic parameters of speech, reconstructs the trajectory of acoustic parameters through parameter generation algorithms, and finally invokes the speech synthesizer to generate speech waveforms . The disadvantage of this method is that the sound quality, naturalness a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/08G10L13/02
Inventor 陶建华刘善峰
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products