Voice synthesis model training method and device, storage medium and electronic equipment

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speech synthesis and model technology, applied in speech synthesis, speech analysis, speech recognition and other directions, can solve the problems of cumbersome training process and affect the accuracy of training efficiency model, and achieve the effect of improving training efficiency

Pending Publication Date: 2021-02-02

BEIJING DA MI TECH CO LTD

View PDF0 Cites 3 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

In related technologies, the speech synthesis model can be used not only in the application scenario of a single speaker, but also in the application scenario of multiple speakers. However, the training process of the multi-person speech synthesis model is relatively cumbersome, especially when adding Due to insufficient data and other reasons, the training efficiency and the accuracy of the model are affected

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0043] In order to make the purpose, features, and advantages of the embodiments of the present application more obvious and understandable, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, The described embodiments are only some of the embodiments of the present application, but not all of them. Based on the embodiments in this application, all other embodiments obtained by those skilled in the art without making creative efforts belong to the scope of protection of this application.

[0044] When the following description refers to the accompanying drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary embodiments do not represent all implementations consistent with this application. Rather, they are m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The embodiment of the invention discloses a voice synthesis model training method. The method comprises the following steps: obtaining the first voice data of a target user, determining the second voice data with the maximum similarity with the first voice data in a voice data set based on a speaker classification network, and training an initial speech synthesis model based on the second speech data to obtain a target speech synthesis model. When a new target user is trained for the speech synthesis model, speech data most similar to the speaking style of the target user is found in an existing speech data set to train the initial speech synthesis model to obtain the target speech synthesis model, the initial speech synthesis model is a multi-person speech synthesis model, and the training efficiency of the multi-person speech synthesis model is improved.

Description

technical field [0001] The invention relates to the technical field of speech processing, in particular to a training method, device, storage medium and electronic equipment of a speech synthesis model. Background technique [0002] With the development of artificial intelligence technology, people pay more and more attention to speech synthesis technology. Synthesized speech is applied in various occasions, such as: speech broadcast on public transportation, replacing the teacher's roll call and reading questions in online teaching courses, etc. Weather broadcast, news broadcast and other occasions related to speech synthesis. In related technologies, the speech synthesis model can be used not only in the application scenario of a single speaker, but also in the application scenario of multiple speakers. However, the training process of the multi-person speech synthesis model is relatively cumbersome, especially when adding However, due to insufficient data and other reaso...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L13/02G10L13/033G10L13/08G10L15/02G10L15/06

CPCG10L13/02G10L15/063G10L13/08G10L13/033G10L15/02

Inventor吴雨璇舒景辰梁光周鼎皓杨惠

OwnerBEIJING DA MI TECH CO LTD

Voice synthesis model training method and device, storage medium and electronic equipment

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology