Video synthesis method, device and equipment and storage medium

A video synthesis and video technology, applied in the field of video processing, can solve the problems of low synthesis efficiency and high difficulty of target video synthesis, and achieve the effect of reducing synthesis difficulty and improving synthesis efficiency.

Active Publication Date: 2020-10-02
TENCENT TECH (SHENZHEN) CO LTD
View PDF12 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In the above technical solution, it is necessary to collect a large number of sample voices and sample images to train the machine learning model, and the synthesis of the target video is difficult and the synthesis efficiency is low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Video synthesis method, device and equipment and storage medium
  • Video synthesis method, device and equipment and storage medium
  • Video synthesis method, device and equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] In order to make the purpose, technical solution and advantages of the present application clearer, the implementation manners of the present application will be further described in detail below in conjunction with the accompanying drawings.

[0037] First, the nouns involved in the embodiments of the present application are introduced.

[0038] Phoneme: refers to the smallest unit of speech that is divided according to the natural properties of speech, and is divided according to the pronunciation actions in a syllable, and a pronunciation action constitutes a phoneme. Phonemes include two types of vowels and consonants. For example, the syllable corresponding to the Chinese character "ah" is "a", which corresponds to one phoneme. For example, the syllable corresponding to the Chinese character "爱" is "ai", which corresponds to two phoneme, and so on, the Chinese character "dai" corresponds to three phonemes. It should be noted that a Chinese character corresponds to...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a video synthesis method, device and equipment and a storage medium, and relates to the field of video processing. The method comprises the steps of obtaining a text; acquiringan audio corresponding to the text, and processing the audio to obtain n mouth shape identifiers corresponding to the phoneme sequence in the audio and mouth shape time point information of each mouth shape identifier, n being a positive integer; obtaining a standardized mouth shape sequence frame and a video containing an anchor image, wherein the standardized mouth shape sequence frame comprises mouth shape video frames corresponding to n mouth shape identifiers; and according to the mouth shape time point information of each mouth shape identifier, performing synthesis processing on the mouth shape video frames corresponding to the n mouth shape identifiers and the video containing the live streamer image to obtain a live streamer video. The live streamer video can be synthesized onlyby providing the text without pre-training a machine learning model for synthesizing the video, so that the video synthesis difficulty is reduced, and the video synthesis efficiency is improved.

Description

technical field [0001] The present application relates to the field of video processing, in particular to a video synthesis method, device, equipment and storage medium. Background technique [0002] Information is usually conveyed to the public intuitively in the form of recorded video, such as news broadcasts, conference hosting, legal science popularization, game commentary, etc. [0003] Taking news broadcasting as an example, in order to reduce the labor intensity of manual video recording, the machine learning model after deep learning is used to fuse the target voice sequence containing the news broadcast voice and the face image sequence containing the news anchor to obtain the news broadcast video. [0004] In the above technical solution, it is necessary to collect a large number of sample voices and sample images to train the machine learning model, and the synthesis of the target video is difficult and the synthesis efficiency is low. Contents of the invention ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04N21/233H04N21/234H04N21/2187H04N5/262
CPCH04N21/234H04N21/233H04N21/23424H04N21/23406H04N21/2187H04N5/262
Inventor 董霙刘炳楠
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products