Sound selection method for waveform concatenation speech synthesis

A technology of speech synthesis and waveform splicing, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as inconsistency with the facts, and the sound selection method does not reflect the effect of human ear perception, and achieve the effect of easy splicing

Active Publication Date: 2014-01-22
中科极限元(杭州)智能科技股份有限公司
View PDF4 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] 1. The sound selection method does not reflect the perception of the human ear. Getting a high score in the existing sound selection method does not mean that a sound that is more suitable for human hearing has been selected;
[0005] 2. The sound selection method adopts the method of factor weighted superposition to se

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sound selection method for waveform concatenation speech synthesis
  • Sound selection method for waveform concatenation speech synthesis
  • Sound selection method for waveform concatenation speech synthesis

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0019] In order to make the objectives, technical solutions, and advantages of the present invention clearer, the following further describes the present invention in detail in conjunction with specific embodiments and with reference to the accompanying drawings.

[0020] It should be noted that in the drawings or description of the specification, similar or identical parts use the same drawing numbers. The implementations not shown or described in the drawings are those known to those of ordinary skill in the art. In addition, although this article may provide demonstrations of parameters including specific values, it should be understood that the parameters need not be exactly equal to the corresponding values, but can be approximated to the corresponding values ​​within acceptable error tolerances or design constraints.

[0021] figure 1 This is a flow chart of a voice selection method for waveform splicing speech synthesis according to an embodiment of the present invention, su...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a sound selection method for waveform concatenation speech synthesis. The method comprises the following steps of: on the basis of an original audio, carrying out hidden markov based model training so as to obtain an acoustic model set and a corresponding characteristic decision tree; inputting a plurality of training texts and on the basis of the characteristic decision tree, searching to obtain related acoustic models so as to obtain corresponding target voice and target syllables; according to similarity of the target voice and corresponding candidate primitives and likelihood probability of each acoustic parameter of the candidate primitives under a current acoustic model, training to obtain a similarity classifier; inputting a random text to be synthesized, removing the dissimilar candidate primitives on the basis of the similarity classifier, selecting the optimal primitive from the residual candidate primitives by utilizing a concatenation cost minimization rule and carrying out concatenation to obtain synthetic speech. The adoption of the method disclosed by the invention can synthesize speech with higher tone quality.

Description

technical field [0001] The invention relates to the field of intelligent information processing, in particular to a sound selection method for waveform splicing speech synthesis. Background technique [0002] Speech is one of the main means for human beings to exchange information. Speech synthesis technology mainly enables computers to generate continuous speech with high definition and high naturalness. In the development of speech synthesis technology, the early research mainly used the parametric synthesis method, and later with the development of computer technology, the synthesis method of waveform splicing appeared. With the continuous increase of the corpus, the number of candidate primitives is also increasing. How to select the best primitives for splicing according to the input text has attracted more and more attention. [0003] The parametric speech synthesis system based on hidden Markov model and the splicing system based on primitive selection are the most m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L13/02
Inventor 陶建华张冉温正棋
Owner 中科极限元(杭州)智能科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products