Unlock instant, AI-driven research and patent intelligence for your innovation.

High-resolution remote sensing image multi-label classification method of semantic multi-head attention mechanism

A remote sensing image, high-resolution technology, applied in the field of remote sensing image processing, can solve the problems of being unable to pay attention to long-distance relationships, unable to build local receptive fields, etc., and achieve high-precision results

Pending Publication Date: 2022-02-25
WUHAN UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Image feature extraction based on DCNN is an important step in image multi-label classification. However, due to the convolution operation of DCNN, the receptive field of DCNN is affected by the convolution kernel. The features extracted by DCNN mainly focus on short-distance relationships, and cannot focus on long-term distance relationship
However, the semantic learning method based on the multi-head attention mechanism can build a global relationship but cannot build a local receptive field.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • High-resolution remote sensing image multi-label classification method of semantic multi-head attention mechanism
  • High-resolution remote sensing image multi-label classification method of semantic multi-head attention mechanism
  • High-resolution remote sensing image multi-label classification method of semantic multi-head attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] The specific implementation of the method of the present invention will be further described below in conjunction with the accompanying drawings.

[0032] Such as figure 1 As shown, a semantic relationship learning method with a multi-head attention mechanism with a local receptive field, including the following steps:

[0033] 1) Samples are input into the constructed model for training to obtain trained weights, the model comprising a feature extraction module, a semantically sensitive module, and a semantic relationship building module;

[0034] 2) Input the remote sensing image of the test area as the input source into the trained model;

[0035] 3) Utilize feature extraction to perform feature encoding on the image to obtain a feature map F; the feature extractor is a DCNN model (such as: VGG16, ResNet50, DenseNet201);

[0036] 4) Input F into the semantic-sensitive module to obtain a content-aware category expression S; the semantic-sensitive module includes two...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a high-resolution remote sensing image multi-label classification method of a semantic multi-head attention mechanism. According to the method designed by the invention, the semantic region of the image can be positioned, and the relationship between the tags can be inferred autonomously. The network comprises three modules: a feature extraction module, a semantic sensitive module and a semantic relationship construction module. The feature extraction module extracts image features by using a deep convolutional network. The semantic sensitive module is used for locating semantic attention areas in the features and generating category expressions of content awareness. And the semantic relationship construction module is used for reasoning a label relationship in the content-perceived category expression to predict a final result. Experiments show that by means of the method, the semantic region can be effectively positioned, the relation between categories can be established with better robustness, and the method has higher precision in multi-label classification.

Description

technical field [0001] The invention belongs to the technical field of remote sensing image processing, is a semantic relationship learning method with a multi-head attention mechanism with local receptive fields, and can be applied to image multi-label classification. Background technique [0002] Due to the complexity of remote sensing scenarios, multi-label image classification of high-resolution remote sensing images is more general and practical than single-label image classification. High-resolution remote sensing scenes include various categories with correlations and differences among them. For example, for the correlation between categories, such as: "road" and "car" often co-occur in remote sensing images, "grass" and "water" accompany "golf course", and "airport" often contains "aircraft". There are not only semantic associations between categories, but also spatial location associations. Even if they have the same feature type, their spatial relationships are d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06N3/04
CPCG06N3/045
Inventor 肖志峰谈筱薇
Owner WUHAN UNIV