Speech synthesis method, device and equipment and computer readable storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech synthesis and phoneme technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of low fidelity of synthetic speech and low anthropomorphic degree of synthetic speech, and achieve the effect of improving fidelity and anthropomorphism

Pending Publication Date: 2021-12-24

TENCENT TECH (SHENZHEN) CO LTD

View PDF0 Cites 2 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Although in the speech synthesis process, the naturalness of the synthesized speech can be improved by using contextual text and speech information in the speech synthesis process, or by using a contextual acoustic encoder, however, in the related art, a fixed style is still used to Synthetic speech so that the resulting synthetic speech is less anthropomorphic, resulting in a less realistic synthetic speech

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

preparation example Construction

[0117] see Figure 4 , Figure 4 It is an optional flowchart of the speech synthesis method provided by the embodiment of this application Figure II . In some embodiments of the present application, based on the sentence text, a text feature with a spontaneous behavior label is constructed, that is, the specific implementation process of S102 may include: S1021-S1024, as follows:

[0118] S1021. Perform text feature extraction at the phoneme level on each character information included in the sentence text to obtain text input features of the sentence text.

[0119] The sentence text contains at least one character information, that is, the sentence text is composed of at least one character information. Speech synthesis equipment can use the word segmenter to disassemble the sentence text into individual character information, and then extract text features at the phoneme level for each character information, and use the phoneme-level text features extracted from each cha...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a speech synthesis method, device and equipment and a computer readable storage medium, and relates to a speech technology in the field of artificial intelligence. The method comprises the steps: obtaining a statement text, wherein the statement text records dialogue content waiting for speech synthesis at the current moment; based on the statement text, constructing text features with a spontaneous behavior tag, wherein the spontaneous behavior tag indicates the occurrence position and type of the spontaneous acoustic behavior in the dialogue content; performing feature conversion on the text features to obtain acoustic features corresponding to the statement text; and generating a synthetic speech with a spontaneous acoustic behavior corresponding to the statement text by using the acoustic features. According to the invention, the vivid degree of the synthetic speech can be improved.

Description

technical field [0001] The present application relates to speech technology in the field of artificial intelligence, and in particular to a speech synthesis method, device, equipment and computer-readable storage medium. Background technique [0002] Speech synthesis technology is a technology for generating artificial voice, which can be applied in intelligent customer service, robots and other fields. Although in the speech synthesis process, the naturalness of the synthesized speech can be improved by using contextual text and speech information in the speech synthesis process, or by using a contextual acoustic encoder, however, in the related art, a fixed style is still used to Synthetic speech so that the resulting synthetic speech is less anthropomorphic, ultimately resulting in a less realistic synthetic speech. Contents of the invention [0003] Embodiments of the present application provide a speech synthesis method, device, device, and computer-readable storage ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L13/02G10L13/04G10L13/08

CPCG10L13/02G10L13/08G10L13/04

Inventor 阳珊胡娜李广之苏丹

Owner TENCENT TECH (SHENZHEN) CO LTD

Speech synthesis method, device and equipment and computer readable storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

preparation example Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology