Speech synthesis method and device and computer readable storage medium

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of speech synthesis and speech data, applied in the field of artificial intelligence, to achieve the effect of timbre conversion

Pending Publication Date: 2019-08-16

PING AN TECH (SHENZHEN) CO LTD

View PDF6 Cites 35 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Therefore, in this case, it is obvious that the existing computer can no longer meet such needs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0040] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0041] The invention provides a speech synthesis method. refer to figure 1 As shown, it is a schematic flowchart of a speech synthesis method provided by an embodiment of the present invention. The method may be performed by a device, and the device may be implemented by software and / or hardware.

[0042] In this embodiment, the speech synthesis method includes:

[0043] S1. Receive voice data of a source speaker, convert the voice data of the source speaker into text content, and convert the text content into a text vector.

[0044] The present invention converts Chinese characters in the text content into text vectors through a text embedding module.

[0045] The present invention utilizes said text embedding module to carry out participle operation on the Chinese characters in the input text content, and then t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to the technical field of artificial intelligence, and discloses a speech synthesis method. The method comprises the steps that speech data of a source speaker is converted intotext content, and the text content is converted into a text vector; the text vector is converted into a Mel spectrogram of the source speaker; a speech signal of a target speaker is obtained, the speech signal of the target speaker is converted into Mel frequency cepstrum coefficient characteristics of the target speaker; the Mel spectrogram of the source speaker and the Mel frequency cepstrum coefficient characteristics of the target speaker are input into a trained spectral feature transformation model, and a Mel spectrogram of the target speaker is obtained; the Mel spectrogram of the target speaker is converted into speech corresponding of the text content, and the speech is output. The invention further provides a speech synthesis device and a computer readable storage medium. Accordingly, tone shift of the speech synthesis system can be achieved.

Description

technical field [0001] The present invention relates to the technical field of artificial intelligence, in particular to a speech synthesis method, device and computer-readable storage medium. Background technique [0002] With the development of science and technology, computers can already speak through the speech synthesis system, which is easy for ordinary users to understand and accept. However, existing talking computers can only speak in one mode or in one voice. However, end users often have higher requirements. For example, the user may hope that the computer can read aloud in the user's own voice. So in this case, it is obvious that the existing computers can no longer meet such demands. Contents of the invention [0003] The present invention provides a speech synthesis method, device and computer-readable storage medium, the main purpose of which is to provide a scheme that can realize the timbre conversion of a speech synthesis system. [0004] In order to ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L13/04G10L13/08G10L15/26G10L21/013

CPCG10L15/26G10L13/08G10L21/013G10L2021/0135G10L13/00Y02T10/40

Inventor彭话易程宁王健宗

OwnerPING AN TECH (SHENZHEN) CO LTD

Speech synthesis method and device and computer readable storage medium

What is AI technical title? AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document. A technology of speech synthesis and speech data, applied in the field of artificial intelligence, to achieve the effect of timbre conversion

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of speech synthesis and speech data, applied in the field of artificial intelligence, to achieve the effect of timbre conversion

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology