Speech synthesis method and device

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech synthesis and speech technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of slow prediction speed and time-consuming

Active Publication Date: 2020-11-10

SOUNDAI TECH CO LTD

View PDF9 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] However, since each predicted voice point of Wavenet is fed back to the network to predict the next voice point, that is, only one voice point can be predicted at a time, and the same amount of calculations must be performed for the output of each voice point. Network calculation (weight calculation, convolution calculation and skip connect calculation), very time-consuming

For sampling points that are far away from the prediction point (or "sampling points with little relevance"), the neural network calculation of the same amount of calculation is also performed, resulting in a slower prediction speed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0071] The following will clearly and completely describe the technical solutions in the embodiments of the present application with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are only a part of the embodiments of the present application, not all of them. Based on the embodiments of the present application, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present application.

[0072] The speech synthesis method provided by the embodiment of the present invention can be applied on the server or on the terminal.

[0073] The server can be an application server or a cloud server;

[0074] The terminal can be mobile phone, smart phone, notebook computer, digital broadcast receiver, personal digital assistant (PDA), tablet computer (PAD) and other user equipment (User Equipment, UE), handheld device, vehicle-mounted device, wear...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a speech synthesis method and device. The method comprises the following steps: after obtaining an initial voice point of an initial voice, a corresponding initial voice pointvector and a prediction time period, the initial voice point vector is input into a pre-trained voice synthesis network according to a time sequence of the initial voice point, so that the voice synthesis network sequentially outputs a prediction voice point vector of each to-be-tested moment in the prediction time period; and a preset neural network algorithm is adopted to calculate the predictedvoice point vector of each to-be-tested moment in the prediction time period, so that predicted voice in the prediction time period can be synthesized. According to the method, the calculation amountin the speech prediction process is reduced, and the speech synthesis efficiency is improved.

Description

technical field [0001] The present application relates to the technical field of speech synthesis, in particular to a speech synthesis method and device. Background technique [0002] The current mainstream technology of speech synthesis is neural network model, such as Wavenet, Wavernn, Simplernn, etc., which can realize the conversion from text to sound, that is, speech synthesis. When the Wavenet model is applied to speech synthesis, whether it is English or Mandarin Chinese, human experts can obtain the best results in the industry compared with traditional parametric or splicing systems in evaluating its naturalness. In other words, Wavenet can convert text information into smooth and fluent speech through layer-by-layer causal convolutional neural networks as output. [0003] The network structure of Wavenet is a causal convolutional network, which usually includes a 40-layer convolutional neural network (4 blocks, each block has a 10-layer neural network). In the wa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L13/02G10L13/04G10L13/047

CPCG10L13/02G10L13/04G10L13/047

Inventor 冯大航陈孝良

Owner SOUNDAI TECH CO LTD

Speech synthesis method and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology