Speech synthesis model training method and device thereof, electronic equipment and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of speech synthesis and model training, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as low synthesis accuracy and unnatural synthesized audio, and achieve high accuracy, high practical value, and high synthesis accuracy.

Pending Publication Date: 2020-07-10

GUANGZHOU HUYA TECH CO LTD

View PDF5 Cites 7 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] In view of this, the purpose of this application is to provide a speech synthesis model training method and device, electronic equipment and storage media to improve the unnatural problem of synthesized audio in the existing speech synthesis technology due to the low synthesis accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0047] In order to make the purposes, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments It is only a part of the embodiments of the present application, but not all the embodiments. The components of the embodiments of the application generally described and illustrated in the figures herein may be arranged and designed in a variety of different configurations.

[0048] Accordingly, the following detailed description of the embodiments of the application provided in the accompanying drawings is not intended to limit the scope of the claimed application, but merely represents selected embodiments of the application. Based on the embodiments in this application, all other embodiments obtained by persons of ord...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a speech synthesis model training method and a device thereof, electronic equipment and a storage medium, which relate to the technical field of speech synthesis. The method comprises the following steps: firstly, carrying out first processing on acquired audio sample data to obtain corresponding naturalness information; secondly, performing second processing on the audio sample data to obtain corresponding first phoneme information, and performing identification processing on the first phoneme information to obtain corresponding second phoneme information. Then, a pre-constructed neural network model is trained based on the naturalness information and the second phoneme information to obtain a speech synthesis model, and the speech synthesis model is used for converting input target text data into target audio data; by means of the method, the problem that in an existing speech synthesis technology, due to the fact that the synthesis accuracy is low, the synthesized audio is unnatural can be solved.

Description

technical field [0001] The present application relates to the technical field of speech synthesis, in particular, to a speech synthesis model training method and device, electronic equipment and storage media. Background technique [0002] With the continuous development of the speech synthesis technology, its application range is also wider and wider, making users have higher and higher requirements for synthesized speech. However, the inventors have found that the recognition accuracy of the trained speech synthesis model for data is not high, so that when synthesizing speech, there is a problem that the synthesized audio is not natural due to the low synthesis accuracy. Contents of the invention [0003] In view of this, the purpose of this application is to provide a speech synthesis model training method and device, electronic equipment and storage media to improve the unnatural problem of synthesized audio due to the low synthesis accuracy in the existing speech synt...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L13/02G10L25/03G10L25/24G10L25/30

CPCG10L13/02G10L25/24G10L25/30G10L25/03

Inventor 周阳

Owner GUANGZHOU HUYA TECH CO LTD

Speech synthesis model training method and device thereof, electronic equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology