An Image Description Method Based on Conditional Random Fields and Intrinsic Semantic Attention
A conditional random field and image description technology, applied in the field of semantic description, can solve the problems of single image description, inaccurate results, time-consuming and other problems, achieve good results, ensure accuracy, and enhance the effect of connection
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0071] This embodiment is a process of training and applying on the MS COCO dataset.
[0072] An image description method based on conditional random fields and internal semantic attention, such as figure 1 shown, including the following steps:
[0073] Step 1: Process the training data. The dataset for image description generation on MS COCO is used as the training set. Preprocess all the descriptions in the data set, convert all descriptions to lowercase representation, then count the number of each word, save the words that appear more than 5 times into the dictionary, and save the words that appear less than 5 times Words and blank positions are replaced with "UNK", and the dictionary Vocab is finally obtained. At the same time, for the descriptions in the data set whose reference descriptions are less than 5 sentences, the existing reference descriptions are randomly copied so that each image has at least 5 corresponding descriptions. Then use the spaCy method to extr...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


