Video description text generation method and device, equipment and storage medium

A video description and text technology, applied in the computer field, can solve the problems of low description text generation accuracy, unable to meet the needs of short video description text generation, etc., and achieve the effect of improving the generation accuracy

Pending Publication Date: 2022-07-29
BEIJING DAJIA INTERNET INFORMATION TECH CO LTD +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present disclosure provides a video description text generation method, device, equipment, and storage medium to at least solve the problem in the related art that the generation accuracy of the description text is low and cannot meet the generation requirements of the short video description text

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Video description text generation method and device, equipment and storage medium
  • Video description text generation method and device, equipment and storage medium
  • Video description text generation method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0082] In order to make those skilled in the art better understand the technical solutions of the present disclosure, the technical solutions in the embodiments of the present disclosure will be clearly and completely described below with reference to the accompanying drawings.

[0083] It should be noted that the terms "first", "second" and the like in the description and claims of the present disclosure and the above drawings are used to distinguish similar objects, and are not necessarily used to describe a specific sequence or sequence. It is to be understood that the data so used may be interchanged under appropriate circumstances so that the embodiments of the disclosure described herein can be practiced in sequences other than those illustrated or described herein. The implementations described in the illustrative examples below are not intended to represent all implementations consistent with this disclosure. Rather, they are merely examples of apparatus and methods co...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a video description text generation method and device, equipment and a storage medium. The method comprises the steps that at least two kinds of modal data corresponding to video data to be processed are acquired; inputting the at least two types of modal data into a modal association network to obtain modal association results corresponding to the at least two types of modal data; the modal association result represents association degrees between at least two types of modal data and theme contents of the to-be-processed video data; filtering the at least two types of modal data based on the modal association result to obtain filtered modal data; inputting the filtered modal data into a description text generation network to obtain a description text of the to-be-processed video data; the description text is used for describing the theme content. According to the embodiment of the invention, the interference of noise of some modal data can be eliminated, so that the description text better fitting the theme content of the to-be-processed video data can be generated, and the generation precision of the description text is improved.

Description

technical field [0001] The present disclosure relates to the field of computer technologies, and in particular, to a method, apparatus, device, and storage medium for generating video description text. Background technique [0002] In the related art, given multi-modal data (including visual modality, sound modality, text modality, etc.) are usually aligned and fused to generate a detailed description text of the video content. [0003] However, related technologies are usually suitable for data with small differences in modal information between samples, but the diversity of short video data samples is quite different. Using the solutions in related technologies makes the generation accuracy of the description text low, so it cannot meet the requirements of short video data. Video description text generation requirements. SUMMARY OF THE INVENTION [0004] The present disclosure provides a method, device, device and storage medium for generating video description text, so...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/738G06F16/78G06N3/04G06N3/08
CPCG06F16/739G06F16/7867G06N3/08G06N3/045
Inventor 贾梦朝聂礼强杨浩哲尉寅伟吴建龙张博威戴蒙
Owner BEIJING DAJIA INTERNET INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products