Tibetan-Chinese speech synthesis method and device

A speech synthesis, bilingual technology, applied in speech synthesis, speech analysis, speech recognition and other directions, can solve the problem of lack of large-scale

Inactive Publication Date: 2014-12-17
NORTHWEST NORMAL UNIVERSITY
View PDF6 Cites 33 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For Tibetan, which lacks speech resources, it is difficult to apply the above method to Mandarin-Tibetan multilingual speech synthesis due to the lack of large-scale Chinese-Tibetan bilingual speech corpus

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Tibetan-Chinese speech synthesis method and device
  • Tibetan-Chinese speech synthesis method and device
  • Tibetan-Chinese speech synthesis method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0078] The present invention proposes a method for Chinese-Tibetan bilingual speech synthesis, proposes a word-to-sound conversion algorithm oriented to the Tibetan phonetic symbol SAMPA-T, and realizes the automatic labeling of the SAMPA-T of the Tibetan text corpus, according to Tibetan and The similarity between Mandarin, designed the common Mandarin and Tibetan phonetic transcription system, annotation format and question set, using the corpus of multiple Mandarin and Tibetan speakers, through HMM-based speaker adaptive training and speaker Adaptive transformation algorithm to finally synthesize Chinese or Tibetan speech. Chinese-Tibetan bilingual speech synthesis method flow chart of the present invention is as figure 1 As shown, the specific steps are:

[0079] (1) Design the SAMPA-T tagging scheme for the Tibetan Lhasa dialect, and use the SAMPA-T-oriented word-to-sound conversion algorithm to complete the SAMPA-T automatic tagging of the Tibetan text corpus.

[0080]...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a Tibetan-Chinese speech synthesis method and device and aims to synthesize input Chinese or Tibetan statements to be synthesized, through a Chinese-Tibetan hybrid corpus preliminarily established. Chinese or Tibetan speeches can be synthesized at the same time by the method and the device. Compared with a traditional HMM-based (hidden Markov model based) speech synthesis system, the method and the device have the advantages that a speaker adaptive training process is added to a training phase to acquire a Chinese-Tibetan hybrid speech average model, the influence caused by speaker differences in a speech library can be reduced through the speaker adaptive training process, and quality of synthesized speeches is improved; on the basis of the average model, and a Tibetan or Chinese speech excellent in both naturalness and fluency can be obtained through synthesis of few Tibetan or Chinese corpus data through a speaker adaptive conversion algorithm; the research is significant to promoting the development of communication with minorities and the development of minority speech technologies.

Description

technical field [0001] The present invention relates to the technical field of multilingual speech synthesis, and more specifically, provides a Chinese-Tibetan cross-language bilingual speech synthesis method and device. Background technique [0002] In recent years, multilingual speech synthesis technology has become a research hotspot in the field of human-computer speech interaction. Using this technology can realize human-computer voice interaction in different languages ​​in the same system, which has important application value for countries or regions that speak several languages. China is a country with a large number of minority languages ​​and dialects, which makes the research of this technology important. For example, in Tibetan areas in my country, Mandarin, Tibetan and dialects are mainly spoken. A variety of speech synthesis will be of great significance to promoting communication with ethnic minorities and promoting the development of ethnic minority voice te...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/02G10L15/14
Inventor 杨鸿武王海燕徐世鹏裴东甘振业
Owner NORTHWEST NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products