Method and apparatus for synthesizing a speech with information

A technology for synthesizing speech and speech, which is applied in the field of information processing, can solve the problems of adding watermarks, etc., and achieve the effects of ensuring privacy, low complexity, and reducing the amount of calculation

Active Publication Date: 2013-02-27
KK TOSHIBA
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, from a general point of view, speech synthesis and digital watermarking are two different systems that perform different functions, that is, speech watermarking technology analyzes the speech on the basis of the synthesized speech and then restores it after adding the watermark or adopts some other methods. without adding a watermark while synthesizing speech

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for synthesizing a speech with information
  • Method and apparatus for synthesizing a speech with information
  • Method and apparatus for synthesizing a speech with information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] Various preferred embodiments of the present invention will be described in detail below with reference to the accompanying drawings.

[0031] Method for Synthesizing Speech with Information

[0032] figure 1 is a flowchart of a method for synthesizing speech with information according to an embodiment of the present invention. The present embodiment will be described below with reference to this figure.

[0033] Such as figure 1 As shown, first, in step 101, a text sentence is input. In this embodiment, the input text sentence may be any text sentence known to those skilled in the art, and may also be a text sentence in various languages, such as Chinese, English, Japanese, etc., and the present invention has no limitation on this.

[0034] Next, in step 105, text analysis is performed on the input text sentence to extract linguistic information from the input text sentence. In this embodiment, the linguistic information includes context information, specificall...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

According to one embodiment, an apparatus for synthesizing a speech, comprises an inputting unit configured to input a text sentence, a text analysis unit configured to analyze the text sentence so as to extract linguistic information, a parameter generation unit configured to generate a speech parameter by using the linguistic information and a pre-trained statistical parameter model, an embedding unit configured to embed information into the speech parameter, and a speech synthesis unit configured to synthesize the speech parameter with the information embedded by the embedding unit into a speech with the information.

Description

technical field [0001] The present invention relates to information processing technology, specifically to speech synthesis technology, and more specifically to the technology of embedding information in the process of speech synthesis. Background technique [0002] At present, the speech synthesis system has been applied in many aspects and provides convenience for people's life. However, these synthesized voices are seldom protected by copyright, unlike many other audio products that are well protected by digital watermarking technology. The synthesized voice usually comes from the voice database recorded by professional announcers and forms the required voice through complex synthesis algorithms. In fact, his / their voices themselves should also be protected by copyright. In addition, in many applications, synthesized speech needs to embed some supplementary information to enrich its use and ensure that the embedded information has the least impact on the speech signal, s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L13/02G10L19/018G10L19/00
CPCG10L19/018G10L13/02
Inventor 汪曦栾剑李健
Owner KK TOSHIBA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products