Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Uygur language phoneme-viseme parameter conversion method and system

A Uyghur language and conversion method technology, applied in speech analysis, instrumentation, etc., can solve problems such as complex parameter estimation, reduced mouth shape parameter curve fitting accuracy, and inability to accurately describe the dynamic change process of mouth shape in continuous speech flow.

Active Publication Date: 2017-01-11
XINJIANG UNIVERSITY
View PDF5 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, the direct use of vowel and consonant monophones as the basic dynamic viseme set cannot accurately describe the dynamic change process of mouth shapes in continuous speech
The use of multi-phonemes as dynamic visemes will increase the number of basic visemes, expand the parameter scale of the dynamic viseme model, make parameter estimation extremely complex, reduce the fitting accuracy of the lip-shape parameter curve, and cause inaccurate description The dynamic change process of the actual mouth shape, the lip synthesis effect is distorted

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Uygur language phoneme-viseme parameter conversion method and system
  • Uygur language phoneme-viseme parameter conversion method and system
  • Uygur language phoneme-viseme parameter conversion method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0181] The detailed content of the present invention and its specific implementation will be further described below in conjunction with the accompanying drawings.

[0182] figure 1 It is a schematic diagram of the Uighur phoneme-viseme conversion method of the present invention. As shown in the figure, the present invention first selects Uyghur syllables V, CV, VC, CVCC as the syllable sequence of the Uyghur basic viseme set, then carries out pronunciation recording and video recording of the selected Uyghur syllable sequence, and determines the Uyghur V, CV, VC static viseme extraction time. The determination method is as follows: for the vowel (V), take the central moment of the voice short-term energy curve as the extraction moment of the static viseme; for the consonant (C), because the same consonant is combined with different vowels, the mouth shape will be different. Therefore, CV and VC syllables were selected for cluster analysis of consonant static viseme respecti...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a Uygur language phoneme-viseme parameter conversion method and system, and belongs to the technical field of voice-human face animation information processing. The method comprises the steps: adding 41 features and the visibility features of teeth and a tongue, carrying out the clustering of vowel mouth shape data, and obtaining a vowel basic static viseme set; respectively carrying out the clustering of consonants and mouth shape data combined with different vowels, and obtaining a consonant basic static viseme set; proposing a composite viseme concept based on the above, and building a Uygur language basic dynamic viseme set; giving a composite dynamic viseme model and a dynamic viseme model parameter estimation method based on a linear regression algorithm, thereby achieving the Uygur language phoneme-viseme conversion. According to the invention, the method carries out the text analysis of a to-be-converted Uygur language text according to the basic dynamic viseme set and the model parameters thereof, obtains a basic dynamic viseme sequence in the text, and can generate a human face and lip portion visual voice animation consistent with the content of the text.

Description

technical field [0001] The invention relates to the technical field of information conversion and processing between voice and human face dynamic information, in particular to a method and system for converting Uyghur phoneme-viseme parameters. Background technique [0002] A phoneme is the smallest unit of sound in the phonetic system that can distinguish the meaning of a word or morpheme. There are 32 phonemes in Uyghur, including 8 vowels and 24 consonants. Viseme refers to the physical shape of the mouth, tongue, jaw and other visible pronunciation organs corresponding to a certain phoneme. There are about dozens of phonemes in a language, and some phonemes have similar lip shapes, tongues, and teeth when they are pronounced. Therefore, there is a many-to-one phenomenon between phonemes and visemes. Viseme is the basis of facial lip animation and visual speech synthesis. The definition of the basic static viseme set is to combine phonemes corresponding to similar mout...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L21/10
CPCG10L21/10
Inventor 赵晖刘学杰秦添
Owner XINJIANG UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products