Voice synchronous-drive three-dimensional face mouth shape and face posture animation method

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A voice-synchronized driving and facial gesture technology, which is applied in animation production, computer parts, image data processing, etc., can solve problems such as difficult facial gestures and weak correlations, and achieve the effect of reducing intelligibility and recognizability

Inactive Publication Date: 2013-07-24

SOUTHWEST JIAOTONG UNIV

View PDF5 Cites 70 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

This is because, although there is an obvious correlation between mouth shape and speech, the correlation between facial posture and speech is relatively weak, so it is relatively difficult for speech to drive accurate facial posture

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0030] The present invention will be further described below in conjunction with accompanying drawing and specific embodiment:

[0031] The specific implementation method of the present invention roughly comprises the following steps:

[0032] 1. Viseme classification, because the corresponding mouth-frames of some consonants and finals are similar, in order to reduce the amount of calculation, the present invention carries out viseme classification of some consonants and finals according to their corresponding mouth shapes, which are divided into 16 categories, F 0 -F 15 . Specific categories such as figure 1 shown.

[0033] 2. Create an audio / video corpus and record it with a high-definition video camera. 20 people, 10 men and 10 women, read the classified consonants in step 1, and record audio and video at the same time. When the voice is recorded, facial video information synchronized with the voice is collected. In order to facilitate the retrieval and extraction of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a voice synchronous-drive three-dimensional face mouth shape and face posture animation method. A user can input new voice information, and the new voice information can be preprocessed to combine mouth shape animations and face posture animations which are synchronous with voice on the face head of a virtual man. The method specifically comprises two stages. In a training stage, voice visualization modeling can be achieved through a k-nearest neighbor algorithm (KNN) and hidden Markov model (HMM) mixed model. In a combining stage, the user can input new voice information, characteristics of voice signals are extracted, face posture and mouth shape sequence parameters corresponding to the voice signals can be generated through the KNN and HMM mixed model and are processed in a transition mode, and X face open source software is used to combine delicate and abundant three-dimensional face animations. The method has significant theoretical study value and has wide application prospect in the fields of visual communication, virtual meetings, games, entertainments, teaching assistance and the like.

Description

technical field [0001] The invention relates to the technical field of voice-driven three-dimensional facial animation synthesis, in particular to a voice visualization collaborative pronunciation modeling based on a KNN and HMM hybrid model. Background technique [0002] The research on voice-driven 3D facial animation synthesis is an important content in the field of natural human-computer interaction. Voice-driven 3D facial animation synthesis is to preprocess a person's voice so that it can synthesize lip animation and facial expressions corresponding to the voice on a virtual 3D face head. At present, the research in this area is mainly focused on synchronous synthesis, precise lip animation, and the classification of facial expressions through speech analysis. There is no better way to realize voice to drive virtual human lip animation and facial expressions at the same time. Posture (facial gestures or visual prosody). The so-called facial posture refers to non-verb...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06T13/40G06K9/62

Inventor侯进米辉辉

OwnerSOUTHWEST JIAOTONG UNIV

Voice synchronous-drive three-dimensional face mouth shape and face posture animation method

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology