Speech synthesis method and device, storage medium and electronic equipment

A technology for speech synthesis and speech synthesis, applied in the computer field, can solve problems such as inability to synthesize realism and strong voice, and achieve the effect of improving user experience and enhancing realism

Pending Publication Date: 2022-02-25
BEIJING DA MI TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The embodiment of the present application provides a speech synthesis method, device, storage medium, and electronic

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method and device, storage medium and electronic equipment
  • Speech synthesis method and device, storage medium and electronic equipment
  • Speech synthesis method and device, storage medium and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0022] In order to make the purpose, technical solution and advantages of the present application clearer, the embodiments of the present application will be further described in detail below in conjunction with the accompanying drawings.

[0023] figure 1 A schematic diagram of an exemplary system architecture 100 to which the speech synthesis method or the speech synthesis apparatus of the embodiments of the present application can be applied is shown.

[0024] like figure 1 As shown, the system architecture 100 may include one or more of terminal devices 101 , 102 , and 103 , a network 104 and a server 105 . The network 104 is used to provide a communication link medium between the terminal equipment 101, 102, 103 and the server 105, and various communication client applications can be installed on the terminal equipment 101, 102, 103, such as: video recording application, video playback applications, voice interaction applications, search applications, instant messaging ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a speech synthesis method and device, a storage medium and electronic equipment, and belongs to the technical field of computers. The method comprises the following steps: converting text data into at least one phoneme sequence by a server, enabling the text data to be provided with at least one breath sound tag, performing speech synthesis processing on the at least one phoneme sequence based on a pre-trained speech synthesis model to obtain a Mel spectrum corresponding to the text data, and acquiring synthetic speech corresponding to the text data based on the Mel spectrum corresponding to the text data. The synthetic speech comprises breath sound corresponding to the at least one breath sound tag, the sense of reality of the synthetic speech is enhanced, the synthetic speech can be closer to real person speech, and then the user experience is improved.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to a speech synthesis method, device, storage medium and electronic equipment. Background technique [0002] With the development of artificial intelligence technology, TTS (Text To Speech, speech synthesis technology) has also been developed. Speech synthesis technology can be used to convert text data into natural speech, and its application scenarios are relatively wide, such as: applied to speech dictionaries , news broadcast, SMS broadcast, e-book reading and other scenarios, but in related technologies, the speech synthesis process is relatively complicated, and the resulting speech is too blunt, and there is a big difference between the voice of a real person, resulting in a poor user experience. Contents of the invention [0003] Embodiments of the present application provide a speech synthesis method, device, storage medium, and electronic equipment, which can...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L13/08G10L13/02
CPCG10L13/08G10L13/02
Inventor 杨惠舒景辰梁光吴雨璇周鼎皓
Owner BEIJING DA MI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products