Text Image Multimodal Retrieval Method Based on Deep Topic Model
A topic model and text image technology, which is applied in still image data retrieval, unstructured text data retrieval, still image data query, etc., can solve the problem of limited expression ability, affecting retrieval performance, and inability to accurately describe the deep connection of different modal features and other problems, to achieve the effect of accurate association description, good retrieval performance, and improved retrieval accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0035] The present invention will be described in further detail below in conjunction with the accompanying drawings.
[0036] Refer to attached figure 1 The steps of the present invention are further described in detail.
[0037] Step 1. Preprocess the training data and test data.
[0038] Randomly select 25,000 labeled data from the MIR Flicker dataset in the form of text-image pairs, and use 15,000 of them as training data and 10,000 of them as test data.
[0039] Count the number of repeated words in the text data of the training data and the test data, sort them in descending order, and take the first 2000 words as the vocabulary. For each text, count the number of words that appear in the vocabulary and store them in a vector, and the value on each dimension of the vector represents the number of times the word appears in the document.
[0040] Extract the features of each image to form an image feature matrix with the feature dimension as the number of rows and the t...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


