Emotional Chinese text human voice synthesis method

A synthesis method and emotional technology, applied in the field of Chinese text and human voice synthesis, can solve problems such as unnaturalness, inability to continue the interaction process, and blunt voice expression, so as to improve the performance of emotional rhythm, better effect performance, and synthesis Voice natural effect

Active Publication Date: 2018-08-03
SOUTHEAST UNIV
View PDF8 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] With the popularization of smart phones, speech synthesis engines are also developing very rapidly. At present, there are many mature Chinese speech synthesis applications in the domestic market, but because Chinese is a tone language with intonation, its intonation is composed of multiple It is caused by various factors, including sentence pattern, part of speech, expressed emotion, etc. It is different from the intonation of a pure intonation language, so there are many problems in dealing with Chinese intonation, which directly leads to the current Chinese speech synthesis engine getting The results of the pronunciation are relatively stiff and unnatural, and there is a big difference from the results of native Chinese speakers
Therefore, in the process of human-computer interaction, the voice expression of the machine is very blunt, making the interaction process unable to continue better.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Emotional Chinese text human voice synthesis method
  • Emotional Chinese text human voice synthesis method
  • Emotional Chinese text human voice synthesis method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0041] Example: see figure 1 , a kind of Chinese text human voice synthesis method with emotion, described synthesis method comprises the following steps:

[0042] (1) Construct emotional corpus;

[0043] (2) Speech synthesis with emotion based on waveform splicing;

[0044] The concrete operations of constructing the emotional corpus described in step (1) are as follows:

[0045] (11) Word segmentation and part-of-speech tagging, based on the existing hidden Markov model, perform word segmentation and part-of-speech tagging on the original text, and convert the word segmentation results into text form, add "#" between each word as a separator, and combine the output as Word segmentation text; described step (11) word segmentation and part-of-speech tagging, specifically as follows,

[0046] The word segmentation process is divided into preprocessing, rough segmentation and part-of-speech tagging. Preprocessing includes text filtering and atomic segmentation, filtering unde...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an emotional Chinese text human voice synthesis method, which mainly comprises the steps of: (1) constructing an emotional corpus; (2) and performing emotional speech synthesisbased on waveform splicing. The emotional corpus establishment is mainly implemented by the steps of: (11) segmenting terms and acquiring parts of speech of the terms; (12) performing speech segmentation, and acquiring audio data corresponding to segmented terms based on speech data features and text corpora; (13) and performing emotion analysis, and acquiring emotional feature values of terms, clauses and whole sentences based on text term segmentation and audio features. The emotional speech synthesis based on waveform splicing is implemented by the steps of: (21) segmenting terms and performing emotion analysis on a text to be synthesized, and acquiring parts of speech of words, sentence patterns and emotional features in the text to be synthesized; (22) selecting the optimal corpus, and carrying out matching to obtain the optimal corpus set based on text eigenvalues; (23) and perfomring speech synthesis and waveform splicing, extracting a word audio sequence set from the corpus set, and synthesizing the audio to output a final speech. The emotional Chinese text human voice synthesis method is used for synthesizing and outputting a true human voice speech with emotional features.

Description

technical field [0001] The invention relates to speech synthesis technology, in particular to an emotional Chinese text human voice synthesis method. Background technique [0002] With the popularization of smart phones, speech synthesis engines are also developing very rapidly. At present, there are many mature Chinese speech synthesis applications in the domestic market, but because Chinese is a tone language with intonation, its intonation is composed of multiple It is caused by various factors, including sentence pattern, part of speech, expressed emotion, etc. It is different from the intonation of pure intonation language, so there are many problems when dealing with Chinese intonation, which directly leads to the current Chinese speech synthesis engine getting The pronunciation results of the speakers are relatively stiff and unnatural, and there is a big difference from the results of native Chinese speakers. Therefore, in the process of human-computer interaction, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/02G06F17/27
CPCG10L13/02G06F40/289
Inventor 沈傲东俞豪敏孔佑勇吴剑锋董涵舒华忠王坤
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products