Full convolution video description generation method based on self-optimization mechanism
A video description and self-optimization technology, applied in the field of cross-media generation learning, can solve problems such as the difficulty of recurrent neural network training, the inability to parallelize recurrent neural networks, and the long gradient transmission path, so as to improve usability and user experience, and describe content in natural language Rich, Fast-Training Effects
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0046] The following is a detailed introduction to the fully convolutional video description generation method based on the self-optimization mechanism with reference to the accompanying drawings.
[0047] as attached figure 1 Shown, concrete steps of the present invention comprise:
[0048] Step 1. Collect the required video data from the multimedia data set, and obtain the video and the marked video description.
[0049] In step 1, there are usually multiple natural language descriptions corresponding to a video, and the tag words that are infrequent or useless in the entire data set are sorted out. The steps of sorting are as follows:
[0050] Step 1.1: Count the frequency of all words in the data set annotation in the data set;
[0051] Step 1.2: Filter out meaningless words with numbers in those words;
[0052] Step 1.3: For each image annotation, words that appear less frequently in the entire dataset are considered as relatively minor information in the image and del...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 

