Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Chinese-Tibetan cross-language voice conversion method and system

A speech conversion and cross-language technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of low naturalness and poor intelligibility of speech, and achieve the effect of promoting rapid development and improving naturalness

Inactive Publication Date: 2016-11-16
NORTHWEST NORMAL UNIVERSITY
View PDF6 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at the problems of low naturalness and poor intelligibility in the cross-language speech conversion existing in the prior art, the present invention provides a Chinese-Tibetan bilingual cross-language speech conversion method and system, mainly for Tibetan to Chinese The prosody method of cross-language speech conversion has been researched and finally realized the cross-language conversion of Chinese and Tibetan bilinguals, which will not only promote the rapid development of Tibetan speech information processing technology, but also play a vital role in promoting the communication of speech technology between ethnic groups

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese-Tibetan cross-language voice conversion method and system
  • Chinese-Tibetan cross-language voice conversion method and system
  • Chinese-Tibetan cross-language voice conversion method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] The present invention provides a method for Chinese-Tibetan bilingual cross-language speech conversion. In the training stage, a five-degree tone model is used to establish a tone model, thereby completing the prosody modeling, and then using STRAIGHT to modify the extracted prosody parameters to finally realize Prosody control in the cross-language conversion of Chinese and Tibetan bilinguals improves the naturalness of outputting Chinese.

[0045] The technical solution in the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the present invention. Obviously, what is described is only a part of the embodiments of the present invention, not all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0046] see figure 1 As shown, the inventio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a Chinese-Tibetan cross-language voice conversion method, and the method comprises the following steps: A, carrying out the cutting and marking, element classification and catalogue indexing of a voice corpus through designing a corresponding text corpus and recording the voice corpus, and completing the building of a Tibetan Lhasa dialect syllable database and a standard Chinese prosodic feature analysis database; B, building a fundamental frequency model through employing a five-degree tone model, building a duration conversion model and a pause duration conversion model and completing the building of a rhythm model; C, inputting a Tibetan text, selecting proper syllables from the built Tibetan Lhasa dialect syllable database through employing a decision tree, and completing the voice conversion based on the technology of waveform splicing technology; D, carrying out the modification of rhythm parameters (fundamental frequency, duration and pause duration) of the converted voice through employing a STRAIGHT algorithm, completing the rhythm control, and outputting standard Chinese voices. The method achieves a Chinese-Tibetan cross-language voice conversion system based on rhythm control.

Description

technical field [0001] The invention belongs to the technical field of multilingual speech synthesis, and in particular relates to a Chinese-Tibetan bilingual cross-language speech conversion method and system thereof. Background technique [0002] With the rapid development of science and technology, artificial intelligence has gradually entered people's lives, and voice conversion technology is an important part of artificial intelligence technology, so voice conversion technology is an important research direction in the field of scientific research. Speech conversion technology is a relatively new research direction of artificial intelligence. It is a method of modifying the speech characteristics of the source speaker to make it have the speech characteristics of the target speaker. Same-language speech conversion means that the source speaker and the target speaker speak the same language, and cross-language speech conversion means that the source speaker and the targe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/02G10L13/07G10L13/08
CPCG10L13/02G10L13/07G10L13/08
Inventor 甘振业贾浩洁阮文彬杨鸿武余珊珊孔新杰
Owner NORTHWEST NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products