Unlock instant, AI-driven research and patent intelligence for your innovation.

Speech synthesis method and device

A speech synthesis and speech technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of slow prediction speed and time-consuming

Active Publication Date: 2020-11-10
SOUNDAI TECH CO LTD
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, since each predicted voice point of Wavenet is fed back to the network to predict the next voice point, that is, only one voice point can be predicted at a time, and the same amount of calculations must be performed for the output of each voice point. Network calculation (weight calculation, convolution calculation and skip connect calculation), very time-consuming
For sampling points that are far away from the prediction point (or "sampling points with little relevance"), the neural network calculation of the same amount of calculation is also performed, resulting in a slower prediction speed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method and device
  • Speech synthesis method and device
  • Speech synthesis method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0071] The following will clearly and completely describe the technical solutions in the embodiments of the present application with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are only a part of the embodiments of the present application, not all of them. Based on the embodiments of the present application, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present application.

[0072] The speech synthesis method provided by the embodiment of the present invention can be applied on the server or on the terminal.

[0073] The server can be an application server or a cloud server;

[0074] The terminal can be mobile phone, smart phone, notebook computer, digital broadcast receiver, personal digital assistant (PDA), tablet computer (PAD) and other user equipment (User Equipment, UE), handheld device, vehicle-mounted device, wear...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech synthesis method and device. The method comprises the following steps: after obtaining an initial voice point of an initial voice, a corresponding initial voice pointvector and a prediction time period, the initial voice point vector is input into a pre-trained voice synthesis network according to a time sequence of the initial voice point, so that the voice synthesis network sequentially outputs a prediction voice point vector of each to-be-tested moment in the prediction time period; and a preset neural network algorithm is adopted to calculate the predictedvoice point vector of each to-be-tested moment in the prediction time period, so that predicted voice in the prediction time period can be synthesized. According to the method, the calculation amountin the speech prediction process is reduced, and the speech synthesis efficiency is improved.

Description

technical field [0001] The present application relates to the technical field of speech synthesis, in particular to a speech synthesis method and device. Background technique [0002] The current mainstream technology of speech synthesis is neural network model, such as Wavenet, Wavernn, Simplernn, etc., which can realize the conversion from text to sound, that is, speech synthesis. When the Wavenet model is applied to speech synthesis, whether it is English or Mandarin Chinese, human experts can obtain the best results in the industry compared with traditional parametric or splicing systems in evaluating its naturalness. In other words, Wavenet can convert text information into smooth and fluent speech through layer-by-layer causal convolutional neural networks as output. [0003] The network structure of Wavenet is a causal convolutional network, which usually includes a 40-layer convolutional neural network (4 blocks, each block has a 10-layer neural network). In the wa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/02G10L13/04G10L13/047
CPCG10L13/02G10L13/04G10L13/047
Inventor 冯大航陈孝良
Owner SOUNDAI TECH CO LTD