Social image retrieval method and system based on missing multi-modal hash
A social image, multi-modal technology, applied in still image data retrieval, metadata still image retrieval, special data processing applications, etc., can solve the problems of inaccurate and lack of retrieval.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0063] Such as figure 1 As shown, Embodiment 1 of the present disclosure provides a social image retrieval method based on unsupervised missing multimodal hashing, including:
[0064] S1: Obtain a multimodal retrieval dataset, where each sample includes paired image and text data of two modalities, and divide them into training set, test set and database set. Construct missing data sets for training set, test set and database set respectively;
[0065] This disclosure considers a social image dataset, including social image features and text features marked as labels Both image features and text features contain two parts: fully paired data features and missing data features. is n 1 social image features missing corresponding labels, is n 2 Text features of missing images, where d 1 and d 2 are the dimensions of image and text features, respectively. The goal of this example is to learn a shared hash code B ∈ [-1,1] n×r , where r represents the length of the hash...
Embodiment 2
[0136] Such as figure 2 As shown, Embodiment 2 of the present disclosure provides a social image retrieval method based on supervised missing multimodal hashing, including:
[0137] S1: Obtain multimodal retrieval datasets and construct missing datasets;
[0138] S2: Input the original data of the two modalities into the constructed deep feature extraction model to perform multimodal extraction on the training data set, and then map the extracted multimodal features to a low-dimensional space using the Gaussian kernel function;
[0139] S3: Use the pairwise semantic matrix to guide the projection learning process, and construct an objective function based on supervised missing multimodal hashing on this dataset;
[0140] Existing multimodal hashing methods mostly focus on unsupervised methods, while the development of supervised multimodal hashing methods is seriously lagging behind. Since supervised hashing utilizes discriminative label information to preserve the semantic...
Embodiment 3
[0184] Such as image 3 As shown, Embodiment 3 of the present disclosure provides a social image retrieval system based on missing multimodal hash, including:
[0185] The image preprocessing module is configured to: obtain a multimodal retrieval data set, wherein each sample includes paired image and text two modal data, and divide them into a training set, a test set and a database set. Construct missing data sets for training set, test set and database set respectively;
[0186] The nonlinear feature representation module is configured to: input the original data of the two modalities into the constructed deep feature extraction model to perform multimodal extraction on the training data set, and then use Gaussian The kernel function is mapped to a low-dimensional space;
[0187] The objective function construction module is configured to: for the training multimodal data set, construct the objective function f based on the unsupervised missing multimodal hash on the data...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com