Multi-layer feature fusion fine-grained image classification method with parallel convolution blocks

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of feature fusion and classification methods, applied in neural learning methods, instruments, biological neural network models, etc., can solve problems such as high computer requirements, small number of samples, complex network models, etc., to enhance interaction capabilities and enhance feature expression capabilities. Effect

Active Publication Date: 2021-01-05

TIANJIN UNIV

View PDF11 Cites 3 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The difficulty of fine-grained image classification lies in: the number of classifications of training samples is large, but the number of samples under each category is too small, which easily leads to overfitting; the network model is complex, with a large number of parameters, which requires high computer requirements; The samples under the influence of illumination and posture are quite different, making it difficult for the network to learn discriminative features.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0041] A fine-grained image classification method based on multi-layer feature fusion with parallel convolutional blocks, see figure 1 , the method includes the following steps:

[0042] 101: Obtain internationally public fine-grained datasets;

[0043] 102: Due to the small number of samples in the training data set, the training data set is randomly cropped, horizontally flipped and other operations are performed to enhance the data set, and the weight transfer learning method is used to prevent overfitting;

[0044] 103: Use the parallel convolution block proposed by the present invention on the last level of the ResNet34 model, such as image 3 As shown, instead of the original convolution block, the salient feature extraction ability of the network is enhanced;

[0045] Wherein, the ResNet34 model is well known to those skilled in the art, and will not be described in detail in this embodiment of the present invention.

[0046] 104: Input the training samples, the unif...

Embodiment 2

[0059] The following is combined with specific examples, calculation formulas, Figure 2-Figure 3 , for further introduction, see the description below:

[0060] Such as figure 2 As shown, the high-level salient features of the present invention are extracted by parallel convolutional blocks, and the fusion of inter-layer information is performed using bilinear pooling. suppose and is the high-level feature matrix extracted by parallel convolution blocks. The model proposed in this paper can be expressed by the following formula:

[0061] o bp =σ(N(β(X,Y))+N(β(X,Z))+N(β(Y,Z))) (1)

[0062] Among them, O bp Represents the final output of bilinear pairwise interaction between layers, namely figure 2 The corresponding output results, σ represents the softmax function, N represents the normalization operation, β represents the bilinear operation, R represents the dimensional space representation, h x Represents the length of each feature map of the first layer of the...

Embodiment 3

[0080] Combine below Figure 5-Figure 7 , Table 1-Table 2, the scheme in embodiment 1 and 2 is further introduced, see the following description for details:

[0081] Using the method of the present invention, on the three international standard fine-grained image libraries, compared with the unimproved convolution block, the heat maps on different channels are as follows: Figure 5 Shown: (a), (b), (c), (d) are the results of the CUB-200-2011 dataset; (e), (f) are the results of the FGVC-Aircraft dataset; (g), ( h) is the result of the Stanford-Cars dataset, the upper row uses the unimproved convolution block, and the lower row uses the improved convolution block. It can be seen that using parallel convolutional blocks can fully extract the salient regions of the image to be classified.

[0082] Using the method of the present invention, on three kinds of international standard fine-grained image libraries, the training and testing accuracy, and the corresponding confusion ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a multi-layer feature fusion fine-grained image classification method with parallel convolution blocks. The method comprises: performing random clipping, horizontal overturningenhancement and weight transfer learning on a training set to prevent overfitting; on the last level of the ResNet34 model, replacing an original convolution block by a parallel convolution block, sothat the significance feature extraction capability of the network is enhanced; using an improved ResNet34 model as a feature extractor, three output results of the last level being subjected to bilinear matrix multiplication and then sent to a full connection layer and softmax to be subjected to classification training, a loss function obtained by each batchsize on a training set is returned, and using random gradient descent as a network optimizer to update network parameters. According to the invention, the classification accuracy of fine-grained images is improved.

Description

technical field [0001] The invention relates to the field of fine-grained image classification in image classification tasks, in particular to a multi-layer feature fusion fine-grained image classification method with parallel convolution blocks. Background technique [0002] Fine-grained image classification is the precise division of image subcategories under a certain category. Fine-grained image classification has always been a research difficulty in the field of computer vision and pattern recognition due to its characteristics of "small differences between classes and large differences within classes", and has important research value. [0003] In view of the low accuracy rate of traditional methods for fine-grained image classification and the poor generalization ability of the model; the method based on strong supervision requires manual information such as label boxes, which is difficult to apply to actual production. The weak supervision method is the mainstream m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06K9/62G06N3/04G06N3/08

CPCG06N3/08G06N3/045G06F18/241G06F18/253G06F18/214

Inventor 何凯冯旭马希涛赵岩

Owner TIANJIN UNIV

Multi-layer feature fusion fine-grained image classification method with parallel convolution blocks

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology