Video generation method, device, electronic device and storage medium

A technology for video and generative models, applied in the field of computer vision, which can solve problems such as stiffness, loss of face information, incomplete matching and synchronization of video voice and face images, etc.

Active Publication Date: 2022-03-25
合肥的卢深视科技有限公司
View PDF13 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, most of the above processing methods do not take into account the changes in the body movements of the characters, resulting in rigid and stiff videos.
In addition, due to various analysis processes on the face, these intermediate analysis processes have caused the loss of face information, so that the generated video voice and face image are not completely matched and synchronized.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Video generation method, device, electronic device and storage medium
  • Video generation method, device, electronic device and storage medium
  • Video generation method, device, electronic device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention more clear, various implementation modes of the present invention will be described in detail below in conjunction with the accompanying drawings. However, those of ordinary skill in the art can understand that, in each implementation manner of the present invention, many technical details are provided for readers to better understand the present application. However, even without these technical details and various changes and modifications based on the following implementation modes, the technical solution claimed in this application can also be realized.

[0026] The implementation details of the video generation method in this embodiment are described below with an example. The following contents are only implementation details provided for easy understanding, and are not necessary for implementing this solution.

[0027] Embodiments of the present invention...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the present invention relates to the field of computer vision, and discloses a video generation method, device, electronic equipment, and storage medium. The video generation method of the present invention includes: extracting the audio feature of each frame of the target audio data according to the video frame rate corresponding to the pre-trained video generation model; acquiring the human body posture vector of the person in the target video data synchronized with the target audio data; Using the audio feature and the human body pose vector corresponding to the audio feature through a pre-trained video generation model to obtain a portrait video synchronized with the target audio data, wherein the portrait video includes multiple frames of character images, and the multiple The frame of person image contains the mapping relationship between the audio feature and the human body pose vector. Applied to the process of voice-driven video generation, the generated video voice and portrait are strictly matched and synchronized.

Description

technical field [0001] Embodiments of the present invention relate to the field of computer vision, and in particular, to a video generation method, device, electronic equipment, and storage medium. Background technique [0002] In fields such as artificial intelligence and computer vision, digital or virtual humans that simulate real-life prototypes have been used more and more. The generation of digital human or virtual human mainly utilizes voice-driven video generation technology to generate visual effects as realistic as the original video by estimating the expression, movement and speaking style of the human face at this moment. At present, voice-driven video generation is mostly achieved through processing methods such as reconstruction of 3D faces, efficient regression of expression coefficients, or 2D facial key points. [0003] However, most of the above processing methods do not take into account the changes in the body movements of the characters, resulting in r...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): H04N21/233H04N21/234H04N21/242H04N21/43H04N21/439H04N21/44G06N3/04G06N3/08
CPCH04N21/233H04N21/23418H04N21/23424H04N21/242H04N21/43072H04N21/4394H04N21/44008H04N21/44016G06N3/08G06N3/045
Inventor 郭玉东石彪李廷照户磊
Owner 合肥的卢深视科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products