Check patentability & draft patents in minutes with Patsnap Eureka AI!

Semantic segmentation method introducing feature cross attention mechanism

A technology of attention and mechanism, applied in computer components, instruments, biological neural network models, etc., can solve the problems of less edge modification of segmentation results, rough image segmentation boundaries, and inability to fully utilize long-distance pixel category relationships, etc., to achieve Accurate division and avoid unreasonable labeling effect

Inactive Publication Date: 2021-02-05
XIANGTAN UNIV
View PDF0 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the image semantic segmentation method based on the PascalVoc2012 data set, the currently popular models include FCN, U-Net, and Deeplab series. They all have less edge modification on the segmentation results, resulting in rough boundaries for some image segmentation and long-distance pixel categories. The relationship between the relationship can not be fully utilized and other issues

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Semantic segmentation method introducing feature cross attention mechanism
  • Semantic segmentation method introducing feature cross attention mechanism
  • Semantic segmentation method introducing feature cross attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] In order to make the purpose and technical solution of the present invention clearer, the application principles of the present invention will be described in detail below in conjunction with the accompanying drawings and embodiments.

[0052] The embodiment of the present invention provides a Deeplabv3+ model graph that introduces a feature cross-attention mechanism. Figure 5 It shows a schematic diagram of the improved Deeplabv3+ model, and the specific operation process refers to Figure 5 As shown, the method includes the following steps.

[0053] The Deeplabv3+ model is rewritten based on the Xception network. First, the last fully connected layer is removed to achieve end-to-end output.

[0054] The last two pooling layers of the Xception network are removed, because the convolution itself has translation invariance, and the pooling layer can further enhance this characteristic of the network, because the pooling layer itself is a process of fuzzy position. Sem...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The method aims at solving the problems that a Deeplabv3+ model is inaccurate in picture target edge segmentation, image feature fitting is slow, and attention information cannot be effectively utilized. A feature cross attention module is added to the model, and the cross attention network is composed of two branches and a feature cross attention module. The shallow branches are used for extracting low-level space information, and the deep branches are used for extracting high-level context features, so that important features are extracted more finely. According to the invention, the connection between the feature cross attention mechanism and the Deeplabv3+ coding module is designed and realized, and the output features of the Deeplabv3+ coding module are input into the feature cross attention module for convolution operation to realize re-calibration of the original features. A Deeplabv3+ decoding module acquires spatial features and channel features from two branches respectively,and then fuses the acquired features to acquire more important features. The improved model is verified through a Pascal Voc2012 data set, and results show that the model added with the feature crossattention mechanism can effectively improve the defects of an original model, can segment a target more finely, and better solves the problems of rough segmentation boundary and the like.

Description

technical field [0001] The invention belongs to the field of semantic segmentation, relates to a semantic segmentation model that introduces an attention mechanism, and specifically relates to introducing a double attention mechanism method into a Deeplabv3+ model. Background technique [0002] At present, the convolutional neural network has greatly promoted the execution of visual tasks with its rich representation capabilities, and image semantic segmentation is one of the key tasks to promote computer vision. As a classic computer vision problem (image classification, object recognition detection, semantic segmentation), image semantic segmentation has always been an important research direction. Its essence is to classify pixels in pictures. Image semantic segmentation is widely used in autonomous driving, object classification, cloud detection, medical detection and other related fields. In the image semantic segmentation method based on the PascalVoc2012 data set, th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/34G06K9/62G06N3/04
CPCG06V10/267G06N3/045G06F18/214G06F18/253
Inventor 彭思齐曾海波
Owner XIANGTAN UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More