Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Parallel multi-scale attention mechanism semantic segmentation method and device based on deep learning

A deep learning and semantic segmentation technology, applied in the field of deep learning and computer vision, can solve the problem of not taking into account the differences in different receptive fields, and achieve the effect of increasing the receptive field, improving the segmentation accuracy, and improving the accuracy.

Active Publication Date: 2021-01-15
XIANGTAN UNIV
View PDF10 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although ASPP can effectively capture multi-scale information by using several convolution kernels with different hole rates, it does not take into account the differences between the features captured by different receptive fields in the multi-scale information aggregation stage.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Parallel multi-scale attention mechanism semantic segmentation method and device based on deep learning
  • Parallel multi-scale attention mechanism semantic segmentation method and device based on deep learning
  • Parallel multi-scale attention mechanism semantic segmentation method and device based on deep learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0067] The present embodiment will be further described below in conjunction with the accompanying drawings.

[0068] The image processing process involved in this embodiment is as follows: figure 1 shown in figure 1 The neural network model structure includes image preprocessing, downsampling feature extraction module, parallel multi-scale attention module, and upsampling model. The parallel multi-scale attention module includes the ASPP module and the EDS module.

[0069] The preprocessing stage can be understood as a kind of data enhancement, that is, rotating, scaling, cropping and flipping the image. By preprocessing the image, the semantic segmentation effect can be improved and the robustness of the model can be enhanced. Specifically, in this embodiment, the image is first randomly reduced or enlarged by a factor of 0.5 to 1.5, and corresponding padding is performed after reduction, or corresponding cropping is performed after enlargement, so that the image returns ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a parallel multi-scale attention mechanism semantic segmentation method based on deep learning, and the method comprises the steps: firstly carrying out the preprocessing of animage data set, and improving the segmentation precision and robustness of a model; based on ResNet50 obtained after adjustment of a fifth convolution layer, aggregating multi-scale semantic information through a parallel multi-scale attention module connected to the top of a base network. And finally, recovering the image size through bilinear up-sampling. According to the method, a similarity EDS module is added behind a feature map obtained by five original parallel convolution kernels with different sizes. By adding the attention mechanism, important semantic information in a feature mapobtained by five parallel expansion convolution is enhanced, and secondary semantic information is inhibited.

Description

technical field [0001] The invention belongs to the field of deep learning and computer vision, and in particular relates to a deep learning-based parallel multi-scale attention mechanism semantic segmentation method and device. Background technique [0002] Semantic segmentation is a basic and challenging task. Its purpose is to predict the category of each pixel, that is, to learn high-level semantic information and local location information of object outline, object position and object category. As one of the most basic tasks of computer vision, semantic segmentation has been widely used in areas such as autonomous driving, medical diagnosis, video editing, object detection, and aerial image analysis. In recent years, with the development of deep convolutional neural networks, compared with traditional machine learning methods such as random forests, deep convolutional neural networks have more powerful feature extraction capabilities. In particular, the emergence of th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06T7/11G06T5/00G06N3/04G06N3/08
CPCG06T7/11G06N3/084G06T2207/10004G06T2207/20024G06T2207/20081G06T2207/20084G06N3/048G06N3/045G06T5/70Y02T10/40
Inventor 周彦周振王冬丽
Owner XIANGTAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products