Unlock instant, AI-driven research and patent intelligence for your innovation.

Chinese speech synthesis method and device, equipment and storage medium

A speech and Chinese technology, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as difficult parallel processing, long training time, and affecting synthesis quality, so as to enhance model expressiveness and generalization ability, and improve speech quality , the effect of speeding up the training speed

Active Publication Date: 2019-07-30
PING AN TECH (SHENZHEN) CO LTD
View PDF6 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, for Chinese speech synthesis, when performing speech synthesis, the words and sentences are unnatural, the voice is dull, and the sense of rhythm is poor, which affects the quality of synthesis. There is a significant difference between the synthesized voice and the real human voice
[0004] Nowadays, long short term memory (LSTM) and other recurrent neural network (RNN) structures are commonly used in speech synthesis, which leads to the need to rely on the results of the previous time step during training, which is difficult to parallelize. too long

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese speech synthesis method and device, equipment and storage medium
  • Chinese speech synthesis method and device, equipment and storage medium
  • Chinese speech synthesis method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The invention provides a method, device, equipment and storage medium for synthesizing Chinese speech, which are used to reduce the training time, enhance the expressiveness and generalization ability of the model, and further improve the quality of the synthesized speech.

[0028] In order to enable those skilled in the art to better understand the solutions of the present invention, the embodiments of the present invention will be described below with reference to the drawings in the embodiments of the present invention.

[0029] The terms "first", "second", "third", "fourth", etc. (if any) in the description and claims of the present invention and the above drawings are used to distinguish similar objects, and not necessarily Used to describe a specific sequence or sequence. It is to be understood that the terms so used are interchangeable under appropriate circumstances such that the embodiments described herein can be practiced in sequences other than those illustrat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the field of language signal processing in the field of artificial intelligence, and discloses a Chinese speech synthesis method and devices, equipment and a storage medium. The training time is shortened, the model expressiveness and generalization ability are enhanced, and the synthesized speech quality is further improved. The method comprises the steps that an initialMeier spectrum and the a target vector are obtained; the target vector is processed to obtain a first sequence which is a two-dimensional tensor; the initial Meier spectrum is processed to obtain a target Meier spectrum; the target corresponding relationship between the first sequence and the target Meier spectrum in each subspace is determined; speech synthesis is conducted according to a self-attention mechanism and the target corresponding relationship, and the target speech is obtained.

Description

technical field [0001] The invention relates to the field of speech signal processing, in particular to a method, device, equipment and storage medium for synthesizing Chinese speech. Background technique [0002] At present, most speech synthesis research at home and abroad is aimed at text-to-speech conversion systems, and can only solve the problem of converting written language into spoken language output in a certain reading style, lacking the performance of different age, gender characteristics, tone, and speech speed, let alone endowed Personal emotional color. With the development of the needs of the information society, higher requirements are put forward for human-computer interaction, and the research of human-computer oral dialogue system is also mentioned on the agenda. [0003] Speech synthesis research has begun to develop from the conversion stage of text to speech to the conversion stage of concept to speech. This not only puts forward higher requirements ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L13/02G10L25/03
CPCG10L13/02G10L25/03Y02D30/70
Inventor 陈闽川马骏王少军
Owner PING AN TECH (SHENZHEN) CO LTD