Video description generation method based on multi-concept knowledge mining and storage medium
A technology for video description and knowledge mining, applied in character and pattern recognition, instruments, calculations, etc., can solve the problem of not covering all the content of the video, and achieve the effect of fast training speed, fast convergence speed, and improved quality
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0035] The present invention will be described in detail below with reference to the accompanying drawings and specific embodiments. This embodiment is implemented on the premise of the technical solution of the present invention, and provides a detailed implementation manner and a specific operation process, but the protection scope of the present invention is not limited to the following embodiments.
[0036] This embodiment provides a method for generating video descriptions based on multi-concept knowledge mining, including: acquiring an input video to be processed, extracting visual features and semantic labels on the input video, optimizing the semantic labels, and obtaining the first A priori semantic label, the extracted visual feature and the prior semantic label are used as the input of the video description generation model based on the Transformer structure, and the corresponding description result is obtained, wherein the visual features include 2D features and 3D ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


