Image semantic segmentation method based on multi-level feature fusion and Gaussian conditional random field

A conditional random field and feature fusion technology, which is applied in image analysis, image enhancement, image data processing, etc., can solve the problems of difficult convergence, difficult convergence of back-end model optimization, and non-convergence, so as to alleviate the slow convergence speed, ease even Difficult to converge effect

Active Publication Date: 2021-07-09
SUZHOU UNIV
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The standard convolutional neural network architecture is not good at dealing with the balance between local information and global information. The pooling layer can make the network achieve a certain degree of spatial invariance and maintain the same computational efficiency, but loses the global context information. The convolutional network without pooling layer is also limited, after all, the receptive field of its neurons can only grow linearly with the number of layers
[0009] 2. The optimization of the back-end model is difficult to converge: the back-end introduces a conditional random field to optimize the output of the segmentation architecture to enhance its ability to capture details
However, whether it is the commonly used Gibbs conditional random field or Markov random field, there are problems such as complex formulas and invariant solutions. These problems often lead to slow convergence, difficult convergence or even non-convergence during model training.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Image semantic segmentation method based on multi-level feature fusion and Gaussian conditional random field
  • Image semantic segmentation method based on multi-level feature fusion and Gaussian conditional random field
  • Image semantic segmentation method based on multi-level feature fusion and Gaussian conditional random field

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0064] Shown in conjunction with the accompanying drawings is a specific implementation of the image semantic segmentation method based on multi-level feature fusion and Gaussian conditional random field of the present invention. The basic frame diagram of the method is as follows figure 1 As shown, this method realizes the full convolution architecture of layer-by-layer fusion of multi-level features by constructing an image pyramid, trying to use a top-down tuning framework to replace the previously popular parallel pooling module, while obtaining features of different scales layer by layer Fusion, which ensures the preferential fusion of features between adjacent layers of the pyramid, maximally captures contextual information. In addition, the Gaussian conditional random field is used to further optimize the front-end output, capture more spatial details, and make the object boundaries in the segmentation effect map more accurate. The output of the final overall architectu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an image semantic segmentation method based on multi-level feature fusion and Gaussian conditional random field, including: 1) constructing an image pyramid; 2) using atrous convolution to keep the resolution of the feature map unchanged; 3) multi-level features Layer fusion tuning architecture; 4) upsampling with bilinear interpolation; 5) definition of loss function; 6) optimized output of Gaussian conditional random field. The present invention realizes the full convolution architecture of layer-by-layer fusion of multi-level features by constructing an image pyramid, and uses a top-down optimization framework to replace the previously popular parallel pooling module. The features between the adjacent layers of the pyramid are preferentially fused to capture the context information to the maximum extent, and the Gaussian conditional random field is used to further optimize the front-end output, capture more spatial details, and make the object boundary in the segmentation effect map more accurate, and finally the output of the overall architecture Get the best semantic segmentation effect.

Description

technical field [0001] The invention relates to an image semantic segmentation method based on multi-level feature fusion and Gaussian conditional random field. Background technique [0002] Early image segmentation only roughly divided the content of the image into several regions, but with the development of research, the roughness of image segmentation can no longer meet the needs of various applications, and then semantic segmentation was proposed. The semantics of an image refers to semantic information such as the category of objects or entities contained in an image or an image region, and image segmentation under semantics is called semantic segmentation. Image semantic segmentation can separate the foreground from the background in a single frame image, and identify the category of each foreground object, which is equivalent to assigning a semantic label to each pixel. Semantic segmentation is a major upgrade of image segmentation in both accuracy and fineness. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06T7/10
CPCG06T2207/20016G06T7/10
Inventor 周鹏程
Owner SUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products