Speech synthesis method and device, equipment and storage medium

A technology of speech synthesis and spectrum, applied in the field of semantic synthesis, can solve the problems of low accuracy rate of synthesized speech, accumulation, model error, etc.

Pending Publication Date: 2021-02-09
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The main purpose of this application is to provide a speech synthesis method, device, computer equipment, and computer-readable storage medium, aiming at solving the problem that in the process of existing prosody control, the process of manually selecting reference speech may cause the accumulation of model errors. Technical issues with low voice accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method and device, equipment and storage medium
  • Speech synthesis method and device, equipment and storage medium
  • Speech synthesis method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] The following will clearly and completely describe the technical solutions in the embodiments of the present application with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0029] The flow charts shown in the drawings are just illustrations, and do not necessarily include all contents and operations / steps, nor must they be performed in the order described. For example, some operations / steps can be decomposed, combined or partly combined, so the actual order of execution may be changed according to the actual situation.

[0030] Embodiments of the present application provide a speech synthesis method, device, computer equipment, and co...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of semantic synthesis, and discloses a speech synthesis method and device, computer equipment and a computer readable storage medium. The method comprisesthe steps: acquiring a to-be-synthesized text, and converting the to-be-synthesized text into graph embedding vector information through a speech synthesis model, encoding the graph embedding vector information according to a graph encoder to generate corresponding first intermediate vector information, generating corresponding Mel language spectrum information according to the first intermediatevector information, and outputting voice synthesis information corresponding to the Mel language spectrum information. The specific semantic information of the text information is analyzed through thegraph auxiliary encoder to be mapped to different voice rhythms, so that the rhythm adjustment process becomes a full-automatic process, and the voice synthesis accuracy is improved. Meanwhile, the invention also relates to a blockchain technology, and the method can be applied to the fields of smart government affairs, smart education, smart medical treatment and the like, thereby further promoting the construction of smart cities.

Description

technical field [0001] The present application relates to the technical field of semantic synthesis, in particular to a speech synthesis method, device, computer equipment and computer-readable storage medium. Background technique [0002] TTS speech synthesis system (Text To Speech speech synthesis system), is an integral part of the intelligent dialogue system. Academia and industry are trying to achieve human-like speech synthesis with limited resources and time. In recent years, after the release of Google's Tacotron and Wavenet, the neural network method has become the mainstream solution in the field of speech synthesis. [0003] At present, the TTS model based on neural network has shown good synthesis effect, but in the process of speech synthesis, prosodic embedding is still a challenging task. After the prosodic vector is extracted from the Mel spectrum, it is input into the attention mechanism together with the output of the encoder at the attention mechanism of...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/02G10L13/04G10L13/10G10L25/24
CPCG10L13/02G10L13/04G10L13/10G10L25/24
Inventor 孙奥兰王健宗程宁
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products