Cross-modal hash retrieval method based on triple deep networks
A deep network and triplet technology, applied in the field of computer vision, can solve the problem of low retrieval accuracy, and achieve the effect of improving accuracy, enriching semantic information, and increasing discriminativeness.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Example Embodiment
[0036] Below in conjunction with the accompanying drawings and specific embodiments, the present invention will be described in further detail,
[0037] refer to figure 1 , the present invention comprises the steps:
[0038] Step 1) Preprocess the data:
[0039] Determine the data of two modalities: image data and text data, use the word2vec method to extract the Bag-of-words feature of the text data to represent the text in a vector form for computer processing, and extract the original pixel features of the image data to retain the original information of the image; Take 80% of the image data as image training data, and the rest as image query data; take the text data corresponding to the image training data as text training data, and the rest as text query data;
[0040] Step 2) Obtain the hash codes of image training data and text training data:
[0041] Input the Bag-of-words features of the text training data into the text deep network to obtain the text training data...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap