Tibetan TTVS system realization method

An implementation method and Tibetan language technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of difficulty in obtaining Tibetan language speech resources and Tibetan mouth shape parameters.

Inactive Publication Date: 2016-03-09
NORTHWEST NORMAL UNIVERSITY
View PDF7 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The realization method of the Tibetan TTVS system provided by the present invention in order to solve the above-mentioned deficiencies i

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Tibetan TTVS system realization method
  • Tibetan TTVS system realization method
  • Tibetan TTVS system realization method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The technical solution of the present invention will be described in detail below in conjunction with the accompanying drawings and embodiments, but the protection scope of the present invention should not be limited thereto.

[0036] Use the existing modeling technology to build a 3D face model; use Tibetan pronunciation characteristics to define Tibetan mouth shapes, use FAP parameters to describe the defined Tibetan mouth shapes and determine the FAP parameter values ​​​​of Tibetan mouth shapes to form Tibetan mouth shapes Shape library; mouth shape library and 3D face model are combined to form a 3D face model library, and a 3D face animation is synthesized from FAP parameter values, 3D face model library, and phoneme duration.

[0037] Prepare Tibetan corpus, including Tibetan audio files and Tibetan text files, extract speech acoustic parameters for speech, train HMM model and build HMM model library; perform text analysis on the input Tibetan text to obtain phonem...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a Tibetan TTVS system realization method. The Tibetan TTVS system realization method includes the following steps of: defining 84 FDP feature points according to an MPEG-4 standard; establishing a 3D human facial initial model, and performing texture mapping on the 3D human facial initial model; describing Tibetan mouth shapes through FAP parameters defined by the MPEG-4 standard, photographing lips of a Tibetan speaker through a camera, and establishing a Tibetan mouth shape library; obtaining a 3D human facial model based on the 3D human facial initial model in combination with the FDP parameters and the Tibetan mouth shape library, and establishing a 3D human facial model library; utilizing a prepared Tibetan corpus, and then performing clustering on HMMs to obtain an HMM library; and firstly performing text analysis on a Tibetan text to obtain context relevant labels and a phoneme sequence after the Tibetan text is input into the Tibetan TTVS system, generating acoustic parameters through a parameter generation algorithm, and finally synthesizing Tibetan speech through an STRAIGHT algorithm. The advantages of the Tibetan TTVS system realization method are that: synchronized playing of a 3D human facial animation and the synthesized Tibetan speech can be achieved.

Description

technical field [0001] The invention relates to the technical field of visual text-to-speech conversion, in particular to a realization method of a Tibetan TTVS system. Background technique [0002] With the development of computer technology, text information and audio information can no longer meet the needs of human-computer interaction, and visual information has become more and more popular in the process of human-computer interaction because of its intuition, image, and friendliness. Combine traditional text information and sound information with visual information to form a direct conversion from text to visual speech, that is, the TTVS (Text to Visual Speech) system. The computer synchronously plays the speaker's face animation, making the human-computer interaction interface more friendly and harmonious. After decades of development, TTVS technology has promoted the further development of human-computer interaction technology from the initial sequential playback of...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/14G10L15/18G10L15/183G10L15/25G10L15/26
CPCG10L15/144G10L15/148G10L15/1807G10L15/183G10L15/25G10L15/26
Inventor 杨鸿武张策陆晓燕郝东亮高海燕徐世鹏甘振业
Owner NORTHWEST NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products