Speech synthesis method and device and computer readable storage medium

A technology of speech synthesis and speech data, applied in the field of artificial intelligence, to achieve the effect of timbre conversion

Pending Publication Date: 2019-08-16
PING AN TECH (SHENZHEN) CO LTD
View PDF6 Cites 35 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, in this case, it is obvious that the

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method and device and computer readable storage medium
  • Speech synthesis method and device and computer readable storage medium
  • Speech synthesis method and device and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0041] The invention provides a speech synthesis method. refer to figure 1 As shown, it is a schematic flowchart of a speech synthesis method provided by an embodiment of the present invention. The method may be performed by a device, and the device may be implemented by software and / or hardware.

[0042] In this embodiment, the speech synthesis method includes:

[0043] S1. Receive voice data of a source speaker, convert the voice data of the source speaker into text content, and convert the text content into a text vector.

[0044] The present invention converts Chinese characters in the text content into text vectors through a text embedding module.

[0045] The present invention utilizes said text embedding module to carry out participle operation on the Chinese characters in the input text content, and then t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of artificial intelligence, and discloses a speech synthesis method. The method comprises the steps that speech data of a source speaker is converted intotext content, and the text content is converted into a text vector; the text vector is converted into a Mel spectrogram of the source speaker; a speech signal of a target speaker is obtained, the speech signal of the target speaker is converted into Mel frequency cepstrum coefficient characteristics of the target speaker; the Mel spectrogram of the source speaker and the Mel frequency cepstrum coefficient characteristics of the target speaker are input into a trained spectral feature transformation model, and a Mel spectrogram of the target speaker is obtained; the Mel spectrogram of the target speaker is converted into speech corresponding of the text content, and the speech is output. The invention further provides a speech synthesis device and a computer readable storage medium. Accordingly, tone shift of the speech synthesis system can be achieved.

Description

technical field [0001] The present invention relates to the technical field of artificial intelligence, in particular to a speech synthesis method, device and computer-readable storage medium. Background technique [0002] With the development of science and technology, computers can already speak through the speech synthesis system, which is easy for ordinary users to understand and accept. However, existing talking computers can only speak in one mode or in one voice. However, end users often have higher requirements. For example, the user may hope that the computer can read aloud in the user's own voice. So in this case, it is obvious that the existing computers can no longer meet such demands. Contents of the invention [0003] The present invention provides a speech synthesis method, device and computer-readable storage medium, the main purpose of which is to provide a scheme that can realize the timbre conversion of a speech synthesis system. [0004] In order to ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L13/04G10L13/08G10L15/26G10L21/013
CPCG10L15/26G10L13/08G10L21/013G10L2021/0135G10L13/00Y02T10/40
Inventor 彭话易程宁王健宗
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products