Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

DeepLabv3plus-IRCNet image semantic segmentation algorithm based on coding and decoding structure

A semantic segmentation, encoding and decoding technology, applied in the field of DeepLabv3plus-IRCNet image semantic segmentation algorithm, can solve the problems of resolution reduction, small targets, missing pixels, etc., achieve the effect of increasing the receptive field, improving segmentation accuracy, and alleviating information loss

Pending Publication Date: 2020-07-10
BEIFANG UNIV OF NATITIES
View PDF3 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there were two problems with FCN at that time: 1) The resolution of the feature map was constantly shrinking, resulting in the loss of some pixels of small objects; 2) The image context information (the relationship between pixels and pixels) was not fully considered, and the rich information could not be fully utilized. Spatial location information
[0003] These methods are all for the purpose of enabling the model to better extract feature information at different resolutions of the middle layer. They are all directly operated on the entire feature map, but in a certain local area of ​​​​an image there are often For extremely small targets, ordinary convolution operations cannot extract the features of small target objects well, and it is difficult to obtain better semantic segmentation results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • DeepLabv3plus-IRCNet image semantic segmentation algorithm based on coding and decoding structure
  • DeepLabv3plus-IRCNet image semantic segmentation algorithm based on coding and decoding structure
  • DeepLabv3plus-IRCNet image semantic segmentation algorithm based on coding and decoding structure

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0062] 1, data set selection: the present invention uses CamVid data set, and it is a segmentation data set that is used to understand urban road scene, has included 367 training pictures, 100 verification pictures and 233 test pictures. The resolution of each image is 360x480 pixels, and all images contain 11 semantic categories.

[0063] 2. Evaluation criteria: In order to evaluate the accuracy of image semantic segmentation results, this paper uses the mIoU index as the evaluation standard, and its formula is:

[0064]

[0065] 3. Implementation process: Based on the Keras deep learning framework, NVIDIA GeForce MX150 GPU is used for calculation, and the cuDnn7.0 library is used for acceleration. In the process of training the network, a data augmentation strategy is adopted. Before entering the model training, first adjust the size of the training data set and the verification data set to 320x320, and adopt the data enhancement strategy to set the minimum batch size (m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a DeepLabv3plus-IRCNet image semantic segmentation algorithm based on a coding and decoding structure, and relates to the technical field of image semantic segmentation. The method is based on DeepLabv3plus, and comprises the following steps: extracting picture features by a deep convolutional neural network formed by introducing depth separable convolution inverted residual modules in series, and introducing a feature map segmentation module under 1 / 16 resolution; amplifying each segmentation feature map to the size before segmentation; extracting features in a parameter sharing mode; splicing the corresponding positions of each output feature map and then performing fusion with the feature map amplified to the same size in the decoding stage, so that the extraction capability of the model for features of a small target object is improved, and the problem of small target loss caused by gradual reduction of the resolution of the feature map in the down-samplingprocess of the image is effectively solved.

Description

technical field [0001] The invention relates to the field of computer vision, in particular to a DeepLabv3plus-IRCNet image semantic segmentation algorithm based on a codec structure. Background technique [0002] Image semantic segmentation is one of the fundamental research topics in the field of computer vision, and its goal is to assign a semantic label to each pixel in an image. The current development of the semantic segmentation model based on deep convolutional neural networks (DNCCs) is mainly due to the proposal of the full convolutional neural network (FCN), which replaces all the fully connected layers behind the convolutional neural network with convolutional layers. , upsampled using a bilinear interpolation algorithm, and outputs a split map with the input size. However, there were two problems with FCN at that time: 1) The resolution of the feature map was constantly shrinking, resulting in the loss of some pixels of small objects; 2) The image context infor...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/34G06K9/46G06N3/04G06T7/60G06N3/08
CPCG06T7/60G06T2207/20084G06V10/267G06V10/40G06V10/44G06N3/045
Inventor 王海荣刘文
Owner BEIFANG UNIV OF NATITIES
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products