Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A weakly supervised fine-grained image recognition method based on visual self-attention mechanism

An image recognition and attention technology, applied in the field of computer vision, can solve problems such as local optimal solutions and complex training methods, and achieve the effects of reducing computational costs, enhancing robustness, and reducing design burdens

Active Publication Date: 2022-04-08
SOUTHEAST UNIV
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This training method is more complicated, and it is easy to fall into a local optimal solution.
[0005] In summary, for weakly supervised fine-grained image recognition tasks that only use image-level label information, existing methods are difficult to detect image discriminative regions and learn fine-grained features simply, accurately and stably, so an adaptive A fine-grained image recognition method based on visual self-attention mechanism with high robustness

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A weakly supervised fine-grained image recognition method based on visual self-attention mechanism
  • A weakly supervised fine-grained image recognition method based on visual self-attention mechanism
  • A weakly supervised fine-grained image recognition method based on visual self-attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] A weakly supervised fine-grained image recognition method based on a visual self-attention mechanism, comprising the following steps:

[0045] Step 1: In the preprocessing stage, the original image of any size is scaled to 600×600 pixels, and on this basis, a 448×448 pixel area is cut out with the center of the image as the origin, according to the mean [0.485, 0.456, 0.406] and standard deviation [0.229, 0.224, 0.225] Normalize the clipping area, and then input the normalized image into the fine-grained recognition model based on the visual self-attention mechanism;

[0046] Step 2: The input image outputs a 14×14×2048-dimensional feature tensor through a shared convolutional neural network. The student-model uses the anchor frame idea of ​​the region proposal network RPN commonly used in the field of target detection to set the step size to 1, 2, 2, Three 3×3 convolutional layers with 128 output channels are sequentially connected to the shared base network to reduce ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a weakly supervised fine-grained image recognition method based on a visual self-attention mechanism, including student-model, teacher-model and classification-model modules; the student-model and teacher-model are learned by Pairwise Approach-based Teacher ‑Student loop feedback mechanism is combined to form a self-attention region recommendation network, thereby enhancing the connection between discriminative region positioning and fine-grained feature learning, and can still detect fine-grained features more accurately in the absence of target bounding boxes and part label points. The discriminative area in the granular image significantly improves the recognition accuracy; at the same time, the three modules of student-model, teacher-model and classification-model share the convolutional layer, effectively compressing the model storage space and reducing the calculation cost, making the method meet real-time It is suitable for large-scale real scenarios; in addition, the dynamic weight distribution mechanism is used in multi-task joint learning to reduce the amount of artificially set hyperparameters and enhance the robustness of the model. Finally, the overall model is trained in an end-to-end single-stage manner and learning to reduce the difficulty of network optimization.

Description

technical field [0001] The invention relates to the technical field of computer vision, in particular to a weakly supervised fine-grained image recognition method based on a visual self-attention mechanism. Background technique [0002] Fine-grained image recognition is a challenging research topic in the field of computer vision, the purpose is to distinguish different subcategories under the same category. Compared with the cross-species coarse-grained image recognition task, the appearance similarity of different types of targets in the fine-grained image dataset is relatively high, while the same type of target has significant visual differences due to factors such as illumination, posture, and viewing angle, so the general depth It is difficult for learning image recognition techniques to accurately discriminate fine-grained object categories. Previous studies have shown that the difficulty of fine-grained image recognition tasks lies in discriminative region localizat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06V10/762G06V10/764G06V10/82G06V10/46G06K9/62G06N3/04G06N3/08
CPCG06N3/084G06V10/464G06N3/045G06F18/23G06F18/24
Inventor 李春国刘杨杨哲杨绿溪徐琴珍
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products