Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice-driven face animation generation method and device, equipment and storage medium

A face and voice technology, which is applied in the field of equipment, storage media, devices, and voice-driven face animation generation methods, and can solve the problems of low generalization of face animation synthesis methods

Pending Publication Date: 2022-07-22
TSINGHUA UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] This application provides a voice-driven face animation generation method, device, equipment and storage medium to solve the low generalization problem of the existing voice-driven face animation synthesis method, by proposing a dynamic face animation based on few-sample learning Radiation field to more accurately model dynamic faces, and realize few-sample learning through the reference image mechanism to improve model generalization

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice-driven face animation generation method and device, equipment and storage medium
  • Voice-driven face animation generation method and device, equipment and storage medium
  • Voice-driven face animation generation method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052]The following describes in detail the embodiments of the present application, examples of which are illustrated in the accompanying drawings, wherein the same or similar reference numerals refer to the same or similar elements or elements having the same or similar functions throughout. The embodiments described below with reference to the accompanying drawings are exemplary, and are intended to be used to explain the present application, but should not be construed as a limitation to the present application.

[0053] The following describes the voice-driven facial animation generation method, apparatus, device, and storage medium of the embodiments of the present application with reference to the accompanying drawings. In view of the low generalization problem of the existing voice-driven face animation synthesis method mentioned by the above-mentioned background technology center, the present application provides a voice-driven face animation generation method, in which...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of computer vision, in particular to a voice-driven face animation generation method, device and equipment and a storage medium, and the method comprises the steps: extracting image features corresponding to any query view angle based on any query view angle and a plurality of reference images; extracting initial audio features of the audio frame by frame, and performing time sequence filtering on the audio features to obtain audio features meeting an inter-frame smoothing condition; and driving a dynamic human face nerve radiation field by using the image features and the audio features, and obtaining a generated image of the current frame after voxel rendering. Therefore, the problem of low generalization of an existing voice-driven face animation synthesis method is solved, a dynamic face is modeled more accurately by proposing a dynamic face radiation field based on few-sample learning, few-sample learning is realized through a reference image mechanism, and model generalization is improved.

Description

technical field [0001] The present application relates to the technical field of computer vision, and in particular, to a voice-driven facial animation generation method, apparatus, device, and storage medium. Background technique [0002] Voice-driven face animation synthesis uses a speech audio as a driving signal to control the mouth shape, and generates a target face video that matches the given audio. This emerging technology has a wide range of application scenarios, such as movie dubbing, video conferencing, online education, and virtual avatars. Although a large amount of related research has emerged recently, how to generate natural and realistic voice-driven facial animation videos still presents considerable challenges. [0003] At present, voice-driven facial animation synthesis methods can be roughly divided into 2D-based methods and 3D-based methods. Among them, 2D-based methods usually rely on Generative Adversarial Networks (GAN), however, due to the lack o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06T13/40G06T15/00
CPCG06T13/40G06T15/005
Inventor 鲁继文周杰沈帅李万华朱政
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products