A sound selection method for waveform splicing speech synthesis

A technology of speech synthesis and waveform splicing, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as inconsistency with the facts, and the sound selection method does not reflect the effect of human ear perception, and achieve the effect of easy splicing

Active Publication Date: 2016-04-13
中科极限元(杭州)智能科技股份有限公司
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] 1. The sound selection method does not reflect the perception of the human ear. Getting a high score in the existing sound selection method does not mean that a sound that is more suitable for human hearing has been selected;
[0005] 2. The sound selection method adopts the method of factor weighted superposition to select the sound, that is, the sub-costs are calculated for each feature of the primitive, and then given weights respectively, and then superimposed into a total sound selection cost to select the sound. This method assumes all factors The effect on the acceptance of primitives is linearly additive, which is clearly not true

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A sound selection method for waveform splicing speech synthesis
  • A sound selection method for waveform splicing speech synthesis
  • A sound selection method for waveform splicing speech synthesis

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in further detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0020] It should be noted that, in the drawings or descriptions of the specification, similar or identical parts all use the same figure numbers. Implementations not shown or described in the accompanying drawings are forms known to those of ordinary skill in the art. Additionally, while illustrations of parameters including particular values ​​may be provided herein, it should be understood that the parameters need not be exactly equal to the corresponding values, but rather may approximate the corresponding values ​​within acceptable error margins or design constraints.

[0021] figure 1 It is a flowchart of a sound selection method for waveform splicing speech synthesis according to an embodiment of the present invent...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a sound selection method for waveform concatenation speech synthesis. The method comprises the following steps of: on the basis of an original audio, carrying out hidden markov based model training so as to obtain an acoustic model set and a corresponding characteristic decision tree; inputting a plurality of training texts and on the basis of the characteristic decision tree, searching to obtain related acoustic models so as to obtain corresponding target voice and target syllables; according to similarity of the target voice and corresponding candidate primitives and likelihood probability of each acoustic parameter of the candidate primitives under a current acoustic model, training to obtain a similarity classifier; inputting a random text to be synthesized, removing the dissimilar candidate primitives on the basis of the similarity classifier, selecting the optimal primitive from the residual candidate primitives by utilizing a concatenation cost minimization rule and carrying out concatenation to obtain synthetic speech. The adoption of the method disclosed by the invention can synthesize speech with higher tone quality.

Description

technical field [0001] The invention relates to the field of intelligent information processing, in particular to a sound selection method for waveform splicing speech synthesis. Background technique [0002] Speech is one of the main means for human beings to exchange information. Speech synthesis technology mainly enables computers to generate continuous speech with high definition and high naturalness. In the development of speech synthesis technology, the early research mainly used the parametric synthesis method, and later with the development of computer technology, the synthesis method of waveform splicing appeared. With the continuous increase of the corpus, the number of candidate primitives is also increasing. How to select the best primitives for splicing according to the input text has attracted more and more attention. [0003] The parametric speech synthesis system based on hidden Markov model and the splicing system based on primitive selection are the most m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L13/02
Inventor 陶建华张冉温正棋
Owner 中科极限元(杭州)智能科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products