Text speech synthesis method after speaker emotion simulated optimization translation

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A text-to-speech, speech synthesis technology, applied in the field of speech translation, can solve the problem of not being able to accurately express the speaker's emotions

Pending Publication Date: 2018-11-16

LANGOGO TECH CO LTD

View PDF6 Cites 12 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] The technical problem to be solved by the present invention is to overcome the defect that the current speech synthesis technology converts text into speech, which simply broadcasts the text mechanically, and cannot accurately express the speaker's emotions, and provides an optimized method for simulating the speaker's emotions. Method for post-translation text-to-speech synthesis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0021] Such as Figure 1-2 As shown, the present invention provides a method for simulating the emotion of the speaker to optimize the text-to-speech synthesis after translation, including a translation device connected to the business background signal, and the translation device is connected with a speech recognition interface and a voiceprint recognition interface through the business background signal , syntax analysis interface, translation interface and speech synthesis interface.

[0022] Specifically, the voice translation synthesis steps are:

[0023] Step 1: The translation device acquires the user's voice and obtains the WAV format;

[0024] Step 2: The business background analyzes the audio file to obtain frequency and speech rate parameters;

[0025] Step 3: The business background imports the voice information to the voiceprint recognition interface, and obtains parameters such as the user's gender and age through the voiceprint recognition system;

[0026] St...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a text speech synthesis method after speaker emotion simulated optimization translation. Voice information of a user is obtained; The background analyzes an audio file to obtain frequency and speech speed parameters; the background is introduced to a voiceprint identification system to obtain gender and age parameters; speech is recognized to obtain text information; an emotion parameter is obtained from the text via analysis on grammar, vocabulary and sentences of the text; frequency, speech speed, gender, age and emotion features are combined, and a characteristic value is set for each feature; and the characteristic values are combined with a speech synthesis SSML grammar to set the broadcast speed, volume and word pause in the speech synthesis SSML grammar. Thus, synthesized speech broadcast of another language can reflect the emotion feature of a native language of a speaker. The mood, tone, vocabulary and grammar features of the speaker are identified, sothat the speech translated synthesis broadcast reflect the emotion of the speaker at present.

Description

technical field [0001] The invention relates to a method for speech synthesis, in particular to a method for simulating a speaker's emotion to optimize translated text and speech synthesis, and belongs to the technical field of speech translation. Background technique [0002] The current speech synthesis technology converts text into speech, but simply broadcasts the text mechanically, and cannot accurately express the speaker's emotions. The present invention recognizes the speaker's tone, intonation, words, grammar and other sound and language features, and dynamically adjusts the speech synthesis rules when the speaker's language is translated into other languages, so that the final speech synthesis report faithfully reflects the current speech the emotions of the recipient. Contents of the invention [0003] The technical problem to be solved by the present invention is to overcome the defect that the current speech synthesis technology converts text into speech, whi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L13/02G10L13/08G10L17/00G10L25/63

CPCG10L13/02G10L13/08G10L17/00G10L25/63

Inventor张岩林彦熊涛

OwnerLANGOGO TECH CO LTD

Text speech synthesis method after speaker emotion simulated optimization translation

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology