Uygur language phoneme-viseme parameter conversion method and system

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A Uyghur language and conversion method technology, applied in speech analysis, instrumentation, etc., can solve problems such as complex parameter estimation, reduced mouth shape parameter curve fitting accuracy, and inability to accurately describe the dynamic change process of mouth shape in continuous speech flow.

Active Publication Date: 2017-01-11

XINJIANG UNIVERSITY

View PDF5 Cites 5 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Therefore, the direct use of vowel and consonant monophones as the basic dynamic viseme set cannot accurately describe the dynamic change process of mouth shapes in continuous speech

The use of multi-phonemes as dynamic visemes will increase the number of basic visemes, expand the parameter scale of the dynamic viseme model, make parameter estimation extremely complex, reduce the fitting accuracy of the lip-shape parameter curve, and cause inaccurate description The dynamic change process of the actual mouth shape, the lip synthesis effect is distorted

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0181] The detailed content of the present invention and its specific implementation will be further described below in conjunction with the accompanying drawings.

[0182] figure 1 It is a schematic diagram of the Uighur phoneme-viseme conversion method of the present invention. As shown in the figure, the present invention first selects Uyghur syllables V, CV, VC, CVCC as the syllable sequence of the Uyghur basic viseme set, then carries out pronunciation recording and video recording of the selected Uyghur syllable sequence, and determines the Uyghur V, CV, VC static viseme extraction time. The determination method is as follows: for the vowel (V), take the central moment of the voice short-term energy curve as the extraction moment of the static viseme; for the consonant (C), because the same consonant is combined with different vowels, the mouth shape will be different. Therefore, CV and VC syllables were selected for cluster analysis of consonant static viseme respecti...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a Uygur language phoneme-viseme parameter conversion method and system, and belongs to the technical field of voice-human face animation information processing. The method comprises the steps: adding 41 features and the visibility features of teeth and a tongue, carrying out the clustering of vowel mouth shape data, and obtaining a vowel basic static viseme set; respectively carrying out the clustering of consonants and mouth shape data combined with different vowels, and obtaining a consonant basic static viseme set; proposing a composite viseme concept based on the above, and building a Uygur language basic dynamic viseme set; giving a composite dynamic viseme model and a dynamic viseme model parameter estimation method based on a linear regression algorithm, thereby achieving the Uygur language phoneme-viseme conversion. According to the invention, the method carries out the text analysis of a to-be-converted Uygur language text according to the basic dynamic viseme set and the model parameters thereof, obtains a basic dynamic viseme sequence in the text, and can generate a human face and lip portion visual voice animation consistent with the content of the text.

Description

technical field [0001] The invention relates to the technical field of information conversion and processing between voice and human face dynamic information, in particular to a method and system for converting Uyghur phoneme-viseme parameters. Background technique [0002] A phoneme is the smallest unit of sound in the phonetic system that can distinguish the meaning of a word or morpheme. There are 32 phonemes in Uyghur, including 8 vowels and 24 consonants. Viseme refers to the physical shape of the mouth, tongue, jaw and other visible pronunciation organs corresponding to a certain phoneme. There are about dozens of phonemes in a language, and some phonemes have similar lip shapes, tongues, and teeth when they are pronounced. Therefore, there is a many-to-one phenomenon between phonemes and visemes. Viseme is the basis of facial lip animation and visual speech synthesis. The definition of the basic static viseme set is to combine phonemes corresponding to similar mout...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L21/10

CPCG10L21/10

Inventor赵晖刘学杰秦添

OwnerXINJIANG UNIVERSITY

Uygur language phoneme-viseme parameter conversion method and system

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology