Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Semantic segmentation method based on feature pyramid attention and mixed attention cascading

A feature pyramid and semantic segmentation technology, applied in the field of pattern recognition, can solve the problems of low segmentation accuracy, difficult to meet the requirements of segmentation accuracy, poor processing of segmentation target edge details, etc., to achieve the effect of optimizing processing and ensuring the speed of inference

Active Publication Date: 2021-04-13
NANJING UNIV OF SCI & TECH
View PDF3 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Existing fast semantic segmentation algorithms usually only retain a simple codec structure for image feature extraction and restoration, and lack of full use of multi-scale feature information, resulting in low segmentation accuracy, especially for edge detail processing of segmentation targets Poor, it is difficult to meet the requirements for segmentation accuracy in practical applications

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Semantic segmentation method based on feature pyramid attention and mixed attention cascading
  • Semantic segmentation method based on feature pyramid attention and mixed attention cascading
  • Semantic segmentation method based on feature pyramid attention and mixed attention cascading

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] A semantic segmentation method based on feature pyramid attention and mixed attention cascade, the specific steps are:

[0056] Step 1. Construct a semantic segmentation training set, specifically:

[0057] Preprocess the images in the Cityscapes urban road dataset, and perform normalization and standardization according to the RGB mean (0.485, 0.456, 0.406) and variance (0.229, 0.224, 0.225) of the dataset. 2975 finely labeled images are used as the training set, and 500 A finely annotated image is used as the validation set.

[0058] Step 2. Construct a deep convolutional neural network. The overall structure is as follows figure 2 Shown:

[0059] The deep convolutional neural network includes an encoder part, a feature pyramid attention module, a mixed attention module, a feature fusion part, and a decoding branch.

[0060] In a further embodiment, the encoder part adopts the structure in the existing MobileNetV2, such as image 3 As shown in a, the present inve...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a semantic segmentation method based on feature pyramid attention and mixed attention cascading. The method comprises the steps of building a semantic segmentation training set; constructing a deep convolutional neural network, wherein the deep convolutional neural network comprises an encoder part, two feature pyramid attention modules, a mixed attention module, a decoding branch, a feature fusion part and a deep separable convolutional layer; training the deep convolutional neural network by using the semantic segmentation training set, and correcting network parameters; and inputting the streetscape road scene image to be segmented into the trained deep convolutional neural network to obtain a segmentation result. The invention can better adapt to the requirements of unmanned vehicle equipment for precision and speed.

Description

technical field [0001] The invention belongs to pattern recognition technology, specifically a semantic segmentation method based on feature pyramid attention and mixed attention cascade. Background technique [0002] Image semantic segmentation (semantic segmentation), also known as scene parsing (scene parsing), is a basic and challenging research direction in computer vision at present. Its task is to assign semantic labels to each pixel in the image, and a scene image Segment and parse into distinct image regions that correspond to semantic categories, including continuous objects (e.g. sky, road, grass) and discrete objects (e.g. people, cars, bicycles), etc. [0003] Image semantic segmentation technology enables computers to understand complex images containing multi-category objects. Research in this area has a wide range of application values ​​in areas such as unmanned vehicles, robot perception, and medical images. In recent years, due to the emergence of GPU com...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06T7/10G06N3/08G06N3/04
CPCG06T7/10G06N3/08G06T2207/20081G06N3/045
Inventor 徐锦浩王琼陈涛陆建峰
Owner NANJING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products