Semantic segmentation method based on multi-scale deformable convolution

A semantic segmentation, multi-scale technology, applied in instruments, biological neural network models, character and pattern recognition, etc., can solve the problem of not being able to extract spatial information well

Pending Publication Date: 2020-02-21
TIANJIN UNIV
View PDF5 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

So if you just use multiple parallel expansion convolutions in the final output...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Semantic segmentation method based on multi-scale deformable convolution
  • Semantic segmentation method based on multi-scale deformable convolution
  • Semantic segmentation method based on multi-scale deformable convolution

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] In order to make the technical solution of the present invention clearer, the present invention will be further elaborated below in conjunction with the accompanying drawings. The present invention is concretely realized according to the following steps:

[0023] The first step is to prepare the dataset.

[0024] Collect images of different categories, and after unifying the categories, generate image label information for the selected category. Each label image is single-channel, and the value of each pixel corresponds to the selected category. The collected images are divided into training set, verification set and test set. The training set is used to train the convolutional neural network, the verification set is used to select the best training model, and the test set is used for subsequent testing of the model effect or practical application. For ease of use, this patent uses the public PASCALVOC2012 data set for related experiments.

[0025] The second step is ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a semantic segmentation method based on multi-scale deformable convolution. The semantic segmentation method comprises the following steps: step 1, preparing semantic segmentation input image data for training and corresponding labels thereof; step 2, training the deep learning network; firstly, a basic model of a network is pre-trained; parameters of the network are further optimized on the basis of pre-training; a relevant semantic segmentation module is added to carry out further training to finally obtain network parameters suitable for the data set, and for a newly input image, the network carries out forward calculation to finally obtain the output of the image, so that the image can classify each pixel in the image to form a semantically segmented output image; and step 3, carrying out weight updating on the loss function by adopting a gradient descent method, so that the loss is gradually reduced, and carrying out iterative training until the network converges or reaches the maximum number of iterations to obtain final network parameters, and storing the trained network model and each parameter weight to form a semantic segmentation model.

Description

technical field [0001] The invention belongs to the field of semantic segmentation and relates to a method for semantically segmenting images by using multi-scale deformable convolution. Background technique [0002] Semantic segmentation is the task of classifying pixels into the categories identified by the dataset. It is a fundamental and challenging field in image processing. This technology is widely used in different fields such as self-driving cars, disease detection in medical images, and drone flight experiments. [0003] In recent years, as the most basic and instructive network at present, based on the fully convolutional neural network [1] (FCN) has achieved a major breakthrough in semantic segmentation methods. Different from the classification network, FCN predicts the pixel category by replacing the fully connected layer of the classification network with a convolutional layer. However, this method predicts each pixel based on a small range of images, whic...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62G06N3/04
CPCG06N3/045G06F18/214G06F18/24
Inventor 马帅庞彦伟
Owner TIANJIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products