Cross-modal hash retrieval method based on triple deep networks
A deep network and triplet technology, applied in the field of computer vision, can solve the problem of low retrieval accuracy, and achieve the effect of improving accuracy, enriching semantic information, and increasing discriminativeness.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0036] Below in conjunction with accompanying drawing and specific embodiment, the present invention is described in further detail,
[0037] refer to figure 1 , the present invention comprises the following steps:
[0038] Step 1) Preprocess the data:
[0039] Determine the data of two modalities: image data and text data, use the word2vec method to extract the Bag-of-words feature of the text data, express the text into a vector form for computer processing, and extract the original pixel features of the image data to retain the original information of the image; And 80% of the image data are used as image training data, and the rest are used as image query data; the text data corresponding to the image training data is used as text training data, and the rest are used as text query data;
[0040] Step 2) Get the hash codes of image training data and text training data:
[0041] Input the Bag-of-words feature of the text training data into the text deep network to obtain ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com