Multi-scale feature fusion image hash retrieval method and system based on deep learning
A multi-scale feature, fusion image technology, applied in still image data retrieval, still image data indexing, still image data clustering/classification, etc., can solve problems such as increased difficulty, avoid information loss, ensure consistency, The effect of increasing the cross-entropy loss
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0036] Such as figure 1 As shown, the multi-scale feature fusion image hash retrieval method based on deep learning of the present invention specifically includes the following steps:
[0037] S1. Training the deep network: Input the training data into the built deep network, and obtain the corresponding hash code and hash function through the backpropagation algorithm.
[0038] Training the deep network includes the convolutional network part in the front section, the multi-scale feature fusion part and the loss function part after the convolutional layer.
[0039] In the convolutional network part, in order to learn image features, the input image is convolved, pooled and activated, and input to the last convolutional layer.
specific example
[0040] The multi-scale feature fusion part after the convolutional layer solves the inconsistency of the input image size, copies the output of the convolutional layer of multiple channels, and divides the area, and performs the maximum pooling operation on the divided area. After pooling, the The data corresponding to the same division scale are added together to obtain the feature vector, and the obtained feature vectors are spliced together and input to the fully connected layer for learning of the hash layer. Among them, a deep convolutional neural network structure with multi-scale feature fusion can be constructed. Specifically, a ResNet network with strong learning ability can be used. The specific example is as follows: for the output of the convolutional layer of d channels, we copy it into n copies, and for each copy, we divide it into regions. The region division can be set according to specific needs, for example, the first One area is divided into one area, the ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 

