Compressive attention model for semantic segmentation of pixel group

An attention model and semantic segmentation technology, applied in biological neural network models, character and pattern recognition, instruments, etc., to achieve the effect of enhancing pixel-level dense prediction and good semantic segmentation effect

Pending Publication Date: 2021-07-09
LIAONING TECHNICAL UNIVERSITY
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In order to overcome the deficiencies in the prior art, the technical problem solved by the present invention is to provide a kind of compressed attention model for semantically segmented pixel groups, and to solve the problem of considering neglected pixel groups attention

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Compressive attention model for semantic segmentation of pixel group
  • Compressive attention model for semantic segmentation of pixel group
  • Compressive attention model for semantic segmentation of pixel group

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] The specific implementation of the present invention will be described in detail below in conjunction with the accompanying drawings. As a part of this specification, the principles of the present invention will be described through examples. Other aspects, features and advantages of the present invention will become clear through the detailed description. In the referenced drawings, the same reference numerals are used for the same or similar components in different drawings.

[0023] Such as figure 1 As shown, the compressed attention model for semantically segmented pixel groups of the present invention includes:

[0024] To learn more representative features for the semantic segmentation task through a reweighting mechanism that considers both local and global aspects.

[0025] The SA model first uses the residual network (ResNets) as the basic residual block. The traditional residual representation is as (1):

[0026]

[0027] F( ) represents the residual func...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a compressed attention model for semantic segmentation of pixel groups. The model utilizes an effective compressed attention (SA) module to solve two unique features of the pixel groups in semantic segmentation: (1) pixel group attention and (2) pixel-by-pixel prediction. Specifically, according to the provided SA model, attention of a pixel group is added to conventional convolution by introducing an attention convolution channel, so that interdependence of space channels is considered in an effective mode. Unlike existing attention models, a compressed global attention model (SA) is generated while aggregating multi-scale features using downsampling channels implemented through pooling layers. Therefore, the SA model enhances the target of pixel-level dense prediction and takes into account the problem of ignored pixel group attention. Compared with other methods, the invention has the advantages that the method is obviously improved, and the test result on a PASCAL data set shows that the PACc and mIoU of the new method are higher than those of classical methods such as FCN50 and FCN101.

Description

technical field [0001] The invention belongs to the technical field of semantic segmentation, and in particular relates to a compressed attention model for semantic segmentation pixel groups. Background technique [0002] Convolutional Network (FCN) recovers the category to which each pixel belongs from abstract features. That is, the classification from the image level is further extended to the classification at the pixel level. The deconvolution layer is used to upsample the feature map of the last convolutional layer to restore it to the same size as the input image, thus producing a prediction for each pixel while preserving the spatial information in the original input image , and finally perform pixel-wise classification on the upsampled feature map. However, most of them focus on improving segmentation performance from the pixel level, but largely ignore the implicit task of pixel grouping. [0003] Multi-Scale Spatial Asymmetric Recalibration (MS-SAR), which demo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/34G06K9/62G06N3/04
CPCG06V10/267G06N3/045G06F18/217
Inventor 叶松发齐向明王晓龙刘强严萍萍李健林
Owner LIAONING TECHNICAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products