Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text speech synthesis method after speaker emotion simulated optimization translation

A text-to-speech, speech synthesis technology, applied in the field of speech translation, can solve the problem of not being able to accurately express the speaker's emotions

Pending Publication Date: 2018-11-16
LANGOGO TECH CO LTD
View PDF6 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The technical problem to be solved by the present invention is to overcome the defect that the current speech synthesis technology converts text into speech, which simply broadcasts the text mechanically, and cannot accurately express the speaker's emotions, and provides an optimized method for simulating the speaker's emotions. Method for post-translation text-to-speech synthesis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text speech synthesis method after speaker emotion simulated optimization translation
  • Text speech synthesis method after speaker emotion simulated optimization translation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0021] Such as Figure 1-2 As shown, the present invention provides a method for simulating the emotion of the speaker to optimize the text-to-speech synthesis after translation, including a translation device connected to the business background signal, and the translation device is connected with a speech recognition interface and a voiceprint recognition interface through the business background signal , syntax analysis interface, translation interface and speech synthesis interface.

[0022] Specifically, the voice translation synthesis steps are:

[0023] Step 1: The translation device acquires the user's voice and obtains the WAV format;

[0024] Step 2: The business background analyzes the audio file to obtain frequency and speech rate parameters;

[0025] Step 3: The business background imports the voice information to the voiceprint recognition interface, and obtains parameters such as the user's gender and age through the voiceprint recognition system;

[0026] St...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text speech synthesis method after speaker emotion simulated optimization translation. Voice information of a user is obtained; The background analyzes an audio file to obtain frequency and speech speed parameters; the background is introduced to a voiceprint identification system to obtain gender and age parameters; speech is recognized to obtain text information; an emotion parameter is obtained from the text via analysis on grammar, vocabulary and sentences of the text; frequency, speech speed, gender, age and emotion features are combined, and a characteristic value is set for each feature; and the characteristic values are combined with a speech synthesis SSML grammar to set the broadcast speed, volume and word pause in the speech synthesis SSML grammar. Thus, synthesized speech broadcast of another language can reflect the emotion feature of a native language of a speaker. The mood, tone, vocabulary and grammar features of the speaker are identified, sothat the speech translated synthesis broadcast reflect the emotion of the speaker at present.

Description

technical field [0001] The invention relates to a method for speech synthesis, in particular to a method for simulating a speaker's emotion to optimize translated text and speech synthesis, and belongs to the technical field of speech translation. Background technique [0002] The current speech synthesis technology converts text into speech, but simply broadcasts the text mechanically, and cannot accurately express the speaker's emotions. The present invention recognizes the speaker's tone, intonation, words, grammar and other sound and language features, and dynamically adjusts the speech synthesis rules when the speaker's language is translated into other languages, so that the final speech synthesis report faithfully reflects the current speech the emotions of the recipient. Contents of the invention [0003] The technical problem to be solved by the present invention is to overcome the defect that the current speech synthesis technology converts text into speech, whi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/02G10L13/08G10L17/00G10L25/63
CPCG10L13/02G10L13/08G10L17/00G10L25/63
Inventor 张岩林彦熊涛
Owner LANGOGO TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products