Embedded speech synthesis method, device, controller and medium

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speech synthesis and embedded technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of unnatural sound, large storage space, occupation, etc., to solve the problem of unnatural speech, reduce storage space requirements, and widely use value Effect

Active Publication Date: 2022-05-17

YUTOU TECH HANGZHOU

View PDF13 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

It can be seen that the existing embedded speech synthesis technology has at least the following disadvantages: first, the storage of sound clips on the embedded device still takes up a lot of storage space, and second, the voice spliced out is not natural enough.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

preparation example Construction

[0057] The embodiment of the present invention provides an embedded speech synthesis method, such as figure 1 shown, including:

[0058] Step S1, obtaining the text information to be played of the embedded device;

[0059] Wherein, the text information to be played is text information to be synthesized into speech, and the text information to be played may be text information directly input by the user through the embedded device, or may be converted by the user through voice interaction with the embedded device. text information, etc.

[0060] Step S2, obtaining a plurality of linguistic feature trees corresponding to the text information to be played from the database of the embedded device;

[0061] Step S3, merging the plurality of linguistic feature trees into one target linguistic feature tree according to the text ranking of the text information to be played;

[0062] Step S4, synthesizing the target linguistic feature tree into speech.

[0063] Among them, the acoust...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present invention relates to an embedded speech synthesis method, device, controller and medium. The method includes acquiring text information to be played of an embedded device; acquiring the text information corresponding to the text information to be played from a database of the embedded device. multiple linguistic feature trees; merging the multiple linguistic feature trees into a target linguistic feature tree according to the text sorting of the text information to be played; synthesizing the target linguistic feature tree into speech. The invention reduces the required storage space on the embedded device and improves the quality of the embedded speech synthesis.

Description

technical field [0001] The invention relates to the technical field of speech processing, in particular to an embedded speech synthesis method, device, controller and medium. Background technique [0002] Embedded device synthesis does not require arbitrary text synthesis in many scenarios, and usually only needs to do text synthesis in related fields. The computing resources and storage resources of embedded systems are much less than those in the cloud. With a small amount of resources, text conversion must be compromised. The quality of voice (also known as speech synthesis, TTS for short in English) can be completely offline. It can be seen that, in the prior art, it is still relatively difficult to implement a set of high-quality TTS on an embedded device without a network. [0003] The speech synthesis engine can usually be divided into a front-end engine and a back-end engine. The front-end can be understood as mapping characters to phonemes and other artificial ling...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L13/02G10L13/04G10L13/08

CPCG10L13/02G10L13/08G10L13/04

Inventor郑杰文

OwnerYUTOU TECH HANGZHOU

Embedded speech synthesis method, device, controller and medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

preparation example Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology