Weak supervision fine-grained image recognition method based on visual self-attention mechanism

An image recognition and attention technology, applied in the field of computer vision, can solve problems such as local optimal solutions and complex training methods

Active Publication Date: 2020-08-14
SOUTHEAST UNIV
View PDF3 Cites 44 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This training method is more complicated, and it is easy to fall into a local optimal solution.
[0005] In summary, for weakly supervised fine-grained image recognition tasks that only use image-level label information, existing methods are difficult to detect image discriminative regions and learn fine-grained features simply, accurately and stably, so an adaptive A fine-grained image recognition method based on visual self-attention mechanism with high robustness

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Weak supervision fine-grained image recognition method based on visual self-attention mechanism
  • Weak supervision fine-grained image recognition method based on visual self-attention mechanism
  • Weak supervision fine-grained image recognition method based on visual self-attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] A weakly supervised fine-grained image recognition method based on visual self-attention mechanism, comprising the following steps:

[0045] Step 1: In the preprocessing stage, the original image of any size is scaled to 600 × 600 pixels, and on this basis, a 448 × 448 pixel area is cropped with the center of the image as the origin, according to the mean [0.485, 0.456, 0.406] and standard deviation [0.229, 0.224, 0.225] normalize the cropped region, and then input the normalized image into a fine-grained recognition model based on the visual self-attention mechanism;

[0046] Step 2: The input image outputs a 14×14×2048-dimensional feature tensor through a shared convolutional neural network. The student-model uses the anchor frame idea of ​​the region proposal network RPN commonly used in the target detection field to set the step size to 1, 2, 2, Three 3×3 convolutional layers with 128 output channels are sequentially connected to the shared base network to reduce th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a weak supervision fine-grained image recognition method based on a visual self-attention mechanism. The method involves a student-model module, a teacher-model module and a classification-model module. The student-model and the teacher-model are combined through a Teacher-Student loop feedback mechanism based on Pairwise Approach sorting learning so as to form a self-attention region recommendation network, so that the relation between discriminative region positioning and fine-grained feature learning is enhanced, the discriminative region in the fine-grained image can still be accurately detected under the condition of lacking a target bounding box and a part marking point, and the recognition accuracy is promoted to be remarkably improved; meanwhile, a convolution layer is shared by the three modules, namely, the student-model, the teacher-model and the class-model, so that the model storage space is effectively compressed, the calculation cost is reduced, the method meets the real-time recognition task requirement, and the method is suitable for a large-scale real scene; and besides, a dynamic weight distribution mechanism is adopted in multi-task jointlearning to reduce the amount of artificially set hyper-parameters and enhance the robustness of the model, finally, the whole model is trained and learned in an end-to-end single-stage mode, and thenetwork optimization difficulty is reduced.

Description

technical field [0001] The invention relates to the technical field of computer vision, in particular to a weakly supervised fine-grained image recognition method based on a visual self-attention mechanism. Background technique [0002] Fine-grained image recognition is a challenging research topic in the field of computer vision, which aims to distinguish different sub-categories of the same general category. Compared with the cross-species coarse-grained image recognition task, the appearance similarity of different kinds of objects in the fine-grained image dataset is higher, and the same kind of objects have significant visual differences due to factors such as illumination, pose, and perspective, so the general depth is used. Learning image recognition techniques has difficulty in accurately discriminating fine-grained object categories. Previous studies have shown that the difficulties of fine-grained image recognition tasks lie in discriminative region localization a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06K9/46G06N3/04G06N3/08
CPCG06N3/084G06V10/464G06N3/045G06F18/23G06F18/24
Inventor 李春国刘杨杨哲杨绿溪徐琴珍
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products