Method and system for realizing video and audio driven face animation combined with modal particle features

A technology that drives the face and realizes the system. It is used in animation production, speech analysis, speech recognition, etc. It can solve the problem of not taking into account the characteristics of the modal particles, and achieve the effect of vivid facial expressions.
CN112614212BActive Publication Date: 2022-05-17SHANGHAI JIAOTONG UNIV

Patent Information

Authority / Receiving Office
CN Β· China
Patent Type
Patents(China)
Current Assignee / Owner
SHANGHAI JIAOTONG UNIV
Publication Date
2022-05-17

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
Patent Text Reader

Abstract

A method and system for video-audio-driven face animation combined with modal particle features, constructing a speech feature matrix by extracting speech features, using modal particles to enhance the multi-layer convolution operation of the training network to sample the feature matrix and map it to a low-dimensional space intermediate variable; convert the input speech into text, identify the modal particle from the text content and construct a one-hot vector, and splice with the intermediate variable to obtain an intermediate variable containing the characteristics of the modal particle; After the product is mapped to the expression AU parameters of the current frame, it is used to fit the AU parameters generated by the video tracking and voice prediction algorithms and then used as the driving parameters of the face model to achieve expression enhancement. In the present invention, by inputting the video content of the user's face and the audio content of the user's voice, the three-dimensional Avatar model in the virtual scene can be jointly driven, and on the basis of real-time driving, the overall and partial facial animation can be more realistic and vivid. Effect.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The present invention relates to a technology in the field of computer graphics, in particular to a method and system for realizing video-audio-driven facial animation combined with features of modal particles. Background technique

[0002] Existing implementations of facial expression animation include traditional interaction modeling and key frame animation methods, motion capture methods based on facial marker tracking, driving methods based on video stream images, and driving methods based on audio prediction. Among them, interactive modeling and key frame animation methods are widely used in games, 3D animation and other fields, and are the mainstream methods for producing high-precision 3D facial animation. This method has the advantages of high precision, mature technology, and is suitable for assembly line production, but it requires long-term settings and adjustments by modelers and animators, which is time-consuming and labor-intensive, and t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More