Check patentability & draft patents in minutes with Patsnap Eureka AI!

Speech synthesis model training method and device thereof, electronic equipment and storage medium

A technology of speech synthesis and model training, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as low synthesis accuracy and unnatural synthesized audio, and achieve high accuracy, high practical value, and high synthesis accuracy.

Pending Publication Date: 2020-07-10
GUANGZHOU HUYA TECH CO LTD
View PDF5 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In view of this, the purpose of this application is to provide a speech synthesis model training method and device, electronic equipment and storage media to improve the unnatural problem of synthesized audio in the existing speech synthesis technology due to the low synthesis accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis model training method and device thereof, electronic equipment and storage medium
  • Speech synthesis model training method and device thereof, electronic equipment and storage medium
  • Speech synthesis model training method and device thereof, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] In order to make the purposes, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments It is only a part of the embodiments of the present application, but not all the embodiments. The components of the embodiments of the application generally described and illustrated in the figures herein may be arranged and designed in a variety of different configurations.

[0048] Accordingly, the following detailed description of the embodiments of the application provided in the accompanying drawings is not intended to limit the scope of the claimed application, but merely represents selected embodiments of the application. Based on the embodiments in this application, all other embodiments obtained by persons of ord...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a speech synthesis model training method and a device thereof, electronic equipment and a storage medium, which relate to the technical field of speech synthesis. The method comprises the following steps: firstly, carrying out first processing on acquired audio sample data to obtain corresponding naturalness information; secondly, performing second processing on the audio sample data to obtain corresponding first phoneme information, and performing identification processing on the first phoneme information to obtain corresponding second phoneme information. Then, a pre-constructed neural network model is trained based on the naturalness information and the second phoneme information to obtain a speech synthesis model, and the speech synthesis model is used for converting input target text data into target audio data; by means of the method, the problem that in an existing speech synthesis technology, due to the fact that the synthesis accuracy is low, the synthesized audio is unnatural can be solved.

Description

technical field [0001] The present application relates to the technical field of speech synthesis, in particular, to a speech synthesis model training method and device, electronic equipment and storage media. Background technique [0002] With the continuous development of the speech synthesis technology, its application range is also wider and wider, making users have higher and higher requirements for synthesized speech. However, the inventors have found that the recognition accuracy of the trained speech synthesis model for data is not high, so that when synthesizing speech, there is a problem that the synthesized audio is not natural due to the low synthesis accuracy. Contents of the invention [0003] In view of this, the purpose of this application is to provide a speech synthesis model training method and device, electronic equipment and storage media to improve the unnatural problem of synthesized audio due to the low synthesis accuracy in the existing speech synt...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L13/02G10L25/03G10L25/24G10L25/30
CPCG10L13/02G10L25/24G10L25/30G10L25/03
Inventor 周阳
Owner GUANGZHOU HUYA TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More