Semantic segmentation method based on feature pyramid attention and mixed attention cascading

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A feature pyramid and semantic segmentation technology, applied in the field of pattern recognition, can solve the problems of low segmentation accuracy, difficult to meet the requirements of segmentation accuracy, poor processing of segmentation target edge details, etc., to achieve the effect of optimizing processing and ensuring the speed of inference

Active Publication Date: 2021-04-13

NANJING UNIV OF SCI & TECH

View PDF3 Cites 14 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Existing fast semantic segmentation algorithms usually only retain a simple codec structure for image feature extraction and restoration, and lack of full use of multi-scale feature information, resulting in low segmentation accuracy, especially for edge detail processing of segmentation targets Poor, it is difficult to meet the requirements for segmentation accuracy in practical applications

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0055] A semantic segmentation method based on feature pyramid attention and mixed attention cascade, the specific steps are:

[0056] Step 1. Construct a semantic segmentation training set, specifically:

[0057] Preprocess the images in the Cityscapes urban road dataset, and perform normalization and standardization according to the RGB mean (0.485, 0.456, 0.406) and variance (0.229, 0.224, 0.225) of the dataset. 2975 finely labeled images are used as the training set, and 500 A finely annotated image is used as the validation set.

[0058] Step 2. Construct a deep convolutional neural network. The overall structure is as follows figure 2 Shown:

[0059] The deep convolutional neural network includes an encoder part, a feature pyramid attention module, a mixed attention module, a feature fusion part, and a decoding branch.

[0060] In a further embodiment, the encoder part adopts the structure in the existing MobileNetV2, such as image 3 As shown in a, the present inve...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a semantic segmentation method based on feature pyramid attention and mixed attention cascading. The method comprises the steps of building a semantic segmentation training set; constructing a deep convolutional neural network, wherein the deep convolutional neural network comprises an encoder part, two feature pyramid attention modules, a mixed attention module, a decoding branch, a feature fusion part and a deep separable convolutional layer; training the deep convolutional neural network by using the semantic segmentation training set, and correcting network parameters; and inputting the streetscape road scene image to be segmented into the trained deep convolutional neural network to obtain a segmentation result. The invention can better adapt to the requirements of unmanned vehicle equipment for precision and speed.

Description

technical field [0001] The invention belongs to pattern recognition technology, specifically a semantic segmentation method based on feature pyramid attention and mixed attention cascade. Background technique [0002] Image semantic segmentation (semantic segmentation), also known as scene parsing (scene parsing), is a basic and challenging research direction in computer vision at present. Its task is to assign semantic labels to each pixel in the image, and a scene image Segment and parse into distinct image regions that correspond to semantic categories, including continuous objects (e.g. sky, road, grass) and discrete objects (e.g. people, cars, bicycles), etc. [0003] Image semantic segmentation technology enables computers to understand complex images containing multi-category objects. Research in this area has a wide range of application values in areas such as unmanned vehicles, robot perception, and medical images. In recent years, due to the emergence of GPU com...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06T7/10G06N3/08G06N3/04

CPCG06T7/10G06N3/08G06T2207/20081G06N3/045

Inventor 徐锦浩王琼陈涛陆建峰

Owner NANJING UNIV OF SCI & TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Semantic segmentation method based on feature pyramid attention and mixed attention cascading

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology