Tibetan-Chinese speech synthesis method and device

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech synthesis, bilingual technology, applied in speech synthesis, speech analysis, speech recognition and other directions, can solve the problem of lack of large-scale

Inactive Publication Date: 2014-12-17

NORTHWEST NORMAL UNIVERSITY

View PDF6 Cites 33 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

For Tibetan, which lacks speech resources, it is difficult to apply the above method to Mandarin-Tibetan multilingual speech synthesis due to the lack of large-scale Chinese-Tibetan bilingual speech corpus

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0078] The present invention proposes a method for Chinese-Tibetan bilingual speech synthesis, proposes a word-to-sound conversion algorithm oriented to the Tibetan phonetic symbol SAMPA-T, and realizes the automatic labeling of the SAMPA-T of the Tibetan text corpus, according to Tibetan and The similarity between Mandarin, designed the common Mandarin and Tibetan phonetic transcription system, annotation format and question set, using the corpus of multiple Mandarin and Tibetan speakers, through HMM-based speaker adaptive training and speaker Adaptive transformation algorithm to finally synthesize Chinese or Tibetan speech. Chinese-Tibetan bilingual speech synthesis method flow chart of the present invention is as figure 1 As shown, the specific steps are:

[0079] (1) Design the SAMPA-T tagging scheme for the Tibetan Lhasa dialect, and use the SAMPA-T-oriented word-to-sound conversion algorithm to complete the SAMPA-T automatic tagging of the Tibetan text corpus.

[0080]...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a Tibetan-Chinese speech synthesis method and device and aims to synthesize input Chinese or Tibetan statements to be synthesized, through a Chinese-Tibetan hybrid corpus preliminarily established. Chinese or Tibetan speeches can be synthesized at the same time by the method and the device. Compared with a traditional HMM-based (hidden Markov model based) speech synthesis system, the method and the device have the advantages that a speaker adaptive training process is added to a training phase to acquire a Chinese-Tibetan hybrid speech average model, the influence caused by speaker differences in a speech library can be reduced through the speaker adaptive training process, and quality of synthesized speeches is improved; on the basis of the average model, and a Tibetan or Chinese speech excellent in both naturalness and fluency can be obtained through synthesis of few Tibetan or Chinese corpus data through a speaker adaptive conversion algorithm; the research is significant to promoting the development of communication with minorities and the development of minority speech technologies.

Description

technical field [0001] The present invention relates to the technical field of multilingual speech synthesis, and more specifically, provides a Chinese-Tibetan cross-language bilingual speech synthesis method and device. Background technique [0002] In recent years, multilingual speech synthesis technology has become a research hotspot in the field of human-computer speech interaction. Using this technology can realize human-computer voice interaction in different languages in the same system, which has important application value for countries or regions that speak several languages. China is a country with a large number of minority languages and dialects, which makes the research of this technology important. For example, in Tibetan areas in my country, Mandarin, Tibetan and dialects are mainly spoken. A variety of speech synthesis will be of great significance to promoting communication with ethnic minorities and promoting the development of ethnic minority voice te...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L13/02G10L15/14

Inventor 杨鸿武王海燕徐世鹏裴东甘振业

Owner NORTHWEST NORMAL UNIVERSITY

Tibetan-Chinese speech synthesis method and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology