Speech synthesis model training method and device, computer equipment and storage medium

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of speech synthesis and model training, which is applied in the field of computer equipment, storage media, devices, and speech synthesis model training methods, can solve the problems of low degree of voice anthropomorphism, achieve the effect of increasing the degree of anthropomorphism and improving the similarity of voiceprints

Active Publication Date: 2022-07-22

PING AN TECH (SHENZHEN) CO LTD

View PDF9 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The embodiment of the present invention provides a speech synthesis model training method, device, computer equipment and storage medium, which solves the problem that the speech generated by the existing speech synthesis model has a low degree of anthropomorphism

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0035] The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0036] The speech synthesis model training method provided in the embodiment of the present invention can be applied to, for example, figure 1 in the application environment shown. like figure 1 As shown, the client (computer device) communicates with the server over the network. Among them, the client, also known as the client, refers to the program that corresponds to the server and provides local services for the client. The client (com...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to the technical field of artificial intelligence, and discloses a speech synthesis model training method and device, computer equipment and a storage medium. The method comprises the following steps: obtaining original text data and a speaker identifier, processing the original text data, obtaining an original text vector and a corresponding original phoneme vector, and carrying out feature enhancement processing on the original text vector and the original phoneme vector to obtain a target text vector and a target tone vector which are more significant in vectors; based on the speaker identifier, obtaining a corresponding target voiceprint vector, splicing the target voiceprint vector, the target text vector and the target tone vector, and training a speech synthesis model by using the spliced target hidden vector to obtain a target speech synthesis model corresponding to the speaker identifier, therefore, the similarity between the voice data synthesized by the updated target voice synthesis model and the voiceprint of the speaker is improved, and the personification degree of the target voice synthesis model is increased.

Description

technical field [0001] The present invention relates to the technical field of artificial intelligence, and in particular, to a speech synthesis model training method, device, computer equipment and storage medium. Background technique [0002] Speech synthesis is to convert the input text sequence into the corresponding natural speech pronunciation, which is an important speech processing task in the process of human-computer interaction. In recent years, speech synthesis technology based on deep neural network has achieved remarkable synthesis effect. With the rapid development of artificial intelligence industry, speech synthesis system has also been widely used. In addition to intelligibility, the requirements for the naturalness, rhythm and sound quality of speech synthesis are also getting higher and higher. [0003] Using deep models for speech synthesis requires consideration of text and corresponding speech. Usually, a large amount of training data is required in t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L13/02G10L13/08G10L25/30

CPCG10L13/02G10L13/08G10L25/30

Inventor张旭龙王健宗程宁

OwnerPING AN TECH (SHENZHEN) CO LTD

Speech synthesis model training method and device, computer equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology