Method and apparatus for synthesizing a speech with information

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology for synthesizing speech and speech, which is applied in the field of information processing, can solve the problems of adding watermarks, etc., and achieve the effects of ensuring privacy, low complexity, and reducing the amount of calculation

Active Publication Date: 2013-02-27

KK TOSHIBA

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, from a general point of view, speech synthesis and digital watermarking are two different systems that perform different functions, that is, speech watermarking technology analyzes the speech on the basis of the synthesized speech and then restores it after adding the watermark or adopts some other methods. without adding a watermark while synthesizing speech

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0030] Various preferred embodiments of the present invention will be described in detail below with reference to the accompanying drawings.

[0031] Method for Synthesizing Speech with Information

[0032] figure 1 is a flowchart of a method for synthesizing speech with information according to an embodiment of the present invention. The present embodiment will be described below with reference to this figure.

[0033] Such as figure 1 As shown, first, in step 101, a text sentence is input. In this embodiment, the input text sentence may be any text sentence known to those skilled in the art, and may also be a text sentence in various languages, such as Chinese, English, Japanese, etc., and the present invention has no limitation on this.

[0034] Next, in step 105, text analysis is performed on the input text sentence to extract linguistic information from the input text sentence. In this embodiment, the linguistic information includes context information, specificall...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

According to one embodiment, an apparatus for synthesizing a speech, comprises an inputting unit configured to input a text sentence, a text analysis unit configured to analyze the text sentence so as to extract linguistic information, a parameter generation unit configured to generate a speech parameter by using the linguistic information and a pre-trained statistical parameter model, an embedding unit configured to embed information into the speech parameter, and a speech synthesis unit configured to synthesize the speech parameter with the information embedded by the embedding unit into a speech with the information.

Description

technical field [0001] The present invention relates to information processing technology, specifically to speech synthesis technology, and more specifically to the technology of embedding information in the process of speech synthesis. Background technique [0002] At present, the speech synthesis system has been applied in many aspects and provides convenience for people's life. However, these synthesized voices are seldom protected by copyright, unlike many other audio products that are well protected by digital watermarking technology. The synthesized voice usually comes from the voice database recorded by professional announcers and forms the required voice through complex synthesis algorithms. In fact, his / their voices themselves should also be protected by copyright. In addition, in many applications, synthesized speech needs to embed some supplementary information to enrich its use and ensure that the embedded information has the least impact on the speech signal, s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L13/02G10L19/018G10L19/00

CPCG10L19/018G10L13/02

Inventor汪曦栾剑李健

OwnerKK TOSHIBA

Method and apparatus for synthesizing a speech with information

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology