Text-to-speech conversion method, device, electronic equipment and storage medium

A technology of text-to-speech and conversion methods, which is applied in the fields of electrical digital data processing, voice input/output, natural language data processing, etc., and can solve the problem of monotonous voice style, affecting user experience, and monotonous style that cannot well reflect the emotions of characters. Changes and other issues, to achieve the effect of rich voice tone and strong expressive force

Pending Publication Date: 2021-05-07
BEIJING VOLCANO ENGINE TECH CO LTD
View PDF7 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, in the existing text-to-speech conversion process, the text can only be converted into the pronunciation of the corresponding pronunciation according to the pronunciation of each word in the novel text, and the voice style obtained after conversion is single. For texts with different emotions or expression styles, the voice There is no difference in the interpretation style of the audiobooks, which will lead to the monotonous style of audiobooks that cannot well reflect the emotional changes of the characters and affect the user experience.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text-to-speech conversion method, device, electronic equipment and storage medium
  • Text-to-speech conversion method, device, electronic equipment and storage medium
  • Text-to-speech conversion method, device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] In order to make the purpose, technical solutions and advantages of the embodiments of the present disclosure clearer, the technical solutions in the embodiments of the present disclosure will be clearly and completely described below in conjunction with the drawings in the embodiments of the present disclosure. Obviously, the described embodiments It is a part of the embodiments of the present disclosure, but not all of them. Based on the embodiments in the present disclosure, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present disclosure.

[0026] Audiobooks have been accepted by more and more people due to their advantages of being easy to use, convenient, and not restricted by the use environment, and have become one of the main ways of reading.

[0027] In the prior art, audio books are mainly audio novels, and the generation of audio novels relies on speech synthesis techno...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

According to a text-to-speech conversion method, a device, electronic equipment and a storage medium provided by the embodiment of the invention, the method comprises the steps of recognizing and obtaining the conversation text of at least one conversation in the to-be-converted text, and determining the role to which each conversation belongs and the state text for describing the conversation state of the role when the role performs each conversation; inputting the conversation text and the state text of each conversation into a trained emotion feature recognition model, so that the trained emotion feature recognition model selects an emotion tag most similar to the emotion expressed by each conversation from a plurality of preset emotion tags according to the state text and outputs the emotion tag; and performing voice conversion processing on the to-be-converted text by using a preset voice corpus based on the emotion label of each conversation in the to-be-converted text and the affiliated role to obtain the voice information, the voice information corresponding to the to-be-converted text obtained in the embodiment of the invention is rich in voice tone, and the user experience is improved. The emotional change of each task in the to-be-converted text can be reflected, and the expressive force is high.

Description

technical field [0001] Embodiments of the present disclosure relate to the field of big data processing, and in particular, to a text-to-speech conversion method, device, electronic equipment, and storage medium. Background technique [0002] Audiobooks have been accepted by more and more people due to their advantages of being easy to use, convenient, and not restricted by the use environment, and have become one of the main ways of reading. [0003] In the prior art, audio books are mainly audio novels, and the generation of audio novels relies on speech synthesis technology. Specifically, the voice corpus can be pre-recorded, and based on the text content of the novel, the text is converted into voice and output to the user. [0004] However, in the existing text-to-speech conversion process, the text can only be converted into the pronunciation of the corresponding pronunciation according to the pronunciation of each word in the novel text. The voice style obtained afte...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/279G06F40/151G06F16/683G06F3/16
CPCG06F16/683G06F3/167
Inventor 潘俊杰
Owner BEIJING VOLCANO ENGINE TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products