Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech synthesis apparatus

一种声音合成、声音的技术,应用在语音合成、语音分析、仪器等方向,能够解决信息丢失、信息丢失或掩埋、音质恶化等问题

Inactive Publication Date: 2006-08-30
PANASONIC INTELLECTUAL PROPERTY CORP OF AMERICA
View PDF5 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, in the transmission line where the bandwidth of transmission such as telephone lines is limited to the main frequency band of the sound signal, there is a possibility that additional information will be lost during the transmission process or additional information will be added to the band that is not lost, that is, the sound signal within the main frequency band, there is a possibility of causing serious deterioration in sound quality
[0014] In addition, the conventional method of deforming a specific cycle of a waveform when synchronizing a cycle of a waveform with a tone mark is not affected by the frequency band of the transmission line, but it must be limited to a short time unit of one cycle. Control, and the amount of deformation of the waveform must also be a small deformation that is not perceived by people as deteriorating the sound quality and is not noticed by people. Therefore, there is a possibility that additional information may be lost or lost during the process of digital / analog conversion or transmission. Problems Buried in Signal Noise

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis apparatus
  • Speech synthesis apparatus
  • Speech synthesis apparatus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] Embodiments of the present invention will be described below with reference to the drawings.

[0045] (Embodiment 1)

[0046] figure 2 It is a functional block diagram of the speech synthesis device and the synthesized speech judgment device according to Embodiment 1 of the present invention.

[0047] figure 2 Among them, the voice synthesis device 200 is a device that converts input text into voice, and is composed of a language processing unit 201, a prosody generation unit 202, and a waveform generation unit 203. The language processing unit 201 language analyzes the input text, and determines the morphological elements and Pronunciation and intonation (accent) corresponding to the syntactic structure, output pronunciation and stress position, sentence sentence reading and dependency information; prosody generation unit 202 outputs the pronunciation and accent position, sentence sentence sentence reading and Dependency information determines the basic frequency, s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

An audio synthesis device capable of embedding additional information which cannot be modified into a synthesis audio without causing audio quality deterioration or band limit includes: a language processing unit (201) for generating synthesized audio generation information required for creating a synthesized audio according to a character string; a prosody generation unit (202) for generating an audio prosody information according to the synthesized audio generation information; and a waveform generation unit (203) for synthesizing audio according to the prosody information. The prosody generation unit (202) embeds code information as watermark information into the prosody information in the area of a predetermined time width not exceeding the phoneme length containing a phoneme boundary.

Description

technical field [0001] The present invention relates to a sound synthesis device, in particular to a sound synthesis device capable of embedding information. Background technique [0002] With the development of digital signal processing technology in the past, in order to prevent the illegal copying of audio data, especially music data, and protect copyright, a technology that uses phase modulation, echo signal or auditory masking technology to embed information that does not affect audiovisual (transparent) has been developed. Way. In these methods, information is embedded after creating audio data as content, and the information is read out by a playback device to ensure that only legitimate right holders can use the content. [0003] As for voice, there is not only voice data produced by a human voice but also voice data produced by so-called voice synthesis. With the remarkable progress of the so-called voice synthesis technology that synthesizes voice from text strin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/02G10L19/00G10L19/018
CPCG10L13/10
Inventor 加藤弓子釜井孝浩
Owner PANASONIC INTELLECTUAL PROPERTY CORP OF AMERICA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products