Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Multi-scale fine-grained image recognition method and system based on multi-granularity attention

A technology of image recognition and attention, applied in the field of image processing, to achieve the effect of improving performance and classification accuracy

Active Publication Date: 2022-02-18
OCEAN UNIV OF CHINA
View PDF6 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In view of the deficiencies in the above-mentioned existing technologies, the present invention provides a multi-scale fine-grained image recognition method and system based on multi-granularity attention. (1) First, in order to solve the problem of how to locate key areas in different image granularity by the attention mechanism, An attention-based multi-granularity structure is proposed, which divides the feature extraction network into several stages. Through the multi-granularity attention module, each stage focuses on capturing the most discriminative features with specific granularity in the corresponding layer of the feature extraction network. area; (2) Then it is proposed to use the parallel multi-scale convolution module to extract feature maps of different scales and different granularities in different stages. The module is divided into several levels, each level contains convolution kernels of different sizes and depths, parallel multi-scale The convolution module can use convolution kernels of different sizes to process input feature maps in parallel to capture the details of feature maps of different scales and granularity; (3) Finally, for the problem of fusing features from different regions in a synergistic manner, this paper Invented and designed a feature fusion module, the feature maps of different stages are compressed into feature vectors through the maximum pooling layer and fused, which not only fully excavates the relationship between different regions, but also effectively integrates low-dimensional spatial information and high-dimensional semantic information , thus improving the classification accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-scale fine-grained image recognition method and system based on multi-granularity attention
  • Multi-scale fine-grained image recognition method and system based on multi-granularity attention
  • Multi-scale fine-grained image recognition method and system based on multi-granularity attention

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0053] combine Figure 1-Figure 5 As shown, this embodiment provides a fine-grained image recognition method based on a multi-scale neural network with multi-granularity attention, including the following steps:

[0054] Step 1. Multi-granularity feature extraction:

[0055] In this embodiment, ScaleNet is used as the feature extraction network, which includes N stages in total.

[0056] In this embodiment, the feature extraction network is divided into S stages through an attention-based multi-granularity structure, images of different granularity are input to different stages of the feature extraction network, and feature extraction is performed on images of different granularities in different stages to obtain different Raw feature maps for granularity.

[0057] Assuming the multi-grain structure based on attention k The input image for the stage is ,in

[0058] . here the first k The original feature map of the output of the stage is F k for:

[0059]

[00...

Embodiment 2

[0104] This embodiment provides a multi-scale fine-grained image recognition system based on multi-granularity attention, combining Figure 5 As shown in the network architecture diagram of the model used in the present invention, the fine-grained image recognition system includes an attention-based multi-granularity structure, a multi-granularity attention module, a parallel multi-scale convolution module, a feature fusion module, and a classifier. The multi-granularity structure based on attention divides the feature extraction network into several stages, and extracts the original feature maps of different granularity images in different stages; the multi-granularity attention module divides the original features of different granularities in each stage The image and its feature map with increased receptive field obtained through the convolution block are fused, and then for the fused feature map, attention weights are generated from the two domains of channel and space, and...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of image processing, and discloses a multi-scale fine-grained image recognition method and system based on multi-granularity attention. The method comprises the following steps: constructing an attention-based multi-granularity structure, dividing a feature extraction network into a plurality of stages, inputting images with different granularities into different stages of the feature extraction network, and carrying out feature extraction on the image to obtain an original feature map; obtaining attention weights from a channel domain and a space domain for the feature map of each stage through a multi-granularity attention module, fusing the attention weights, and then carrying out weighted fusion on the attention weights and the feature map to obtain key regions of different granularities in different stages; constructing a parallel multi-scale convolution module, grouping the feature maps, independently applying different types of convolution kernels to each group of feature maps, and performing feature extraction on the feature maps with different scales and granularities in different stages; and finally, carrying out feature fusion on the obtained feature map. The relationship between different regions can be fully mined, and low-dimensional space information and high-dimensional semantic information are fused.

Description

technical field [0001] The invention belongs to the technical field of image processing, and relates to deep learning and fine-grained image recognition technology, in particular to a multi-scale fine-grained image recognition method and system based on multi-granularity attention. Background technique [0002] Fine-grained image recognition aims to classify more detailed subclasses of coarse-grained categories. However, fine-grained image recognition remains a challenging task due to the high intra-class variance and low between-class variance of fine-grained images. [0003] Early fine-grained image recognition methods addressed this issue with part-based feature representation via human-annotated bounding boxes / part annotations. However, specialized knowledge and a lot of annotation time are required in the labeling process. Therefore, for practical fine-grained image recognition tasks, strongly supervised methods that require a lot of time and resources for annotation ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06V10/25G06V10/80G06V10/82G06K9/62G06N3/04
CPCG06N3/045G06F18/253
Inventor 黄磊安辰魏志强张科
Owner OCEAN UNIV OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products