Image semantic segmentation method based on multi-level feature fusion and Gaussian conditional random field

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A conditional random field and feature fusion technology, which is applied in image analysis, image enhancement, image data processing, etc., can solve the problems of difficult convergence, difficult convergence of back-end model optimization, and non-convergence, so as to alleviate the slow convergence speed, ease even Difficult to converge effect

Active Publication Date: 2021-07-09

SUZHOU UNIV

View PDF2 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The standard convolutional neural network architecture is not good at dealing with the balance between local information and global information. The pooling layer can make the network achieve a certain degree of spatial invariance and maintain the same computational efficiency, but loses the global context information. The convolutional network without pooling layer is also limited, after all, the receptive field of its neurons can only grow linearly with the number of layers

[0009] 2. The optimization of the back-end model is difficult to converge: the back-end introduces a conditional random field to optimize the output of the segmentation architecture to enhance its ability to capture details

However, whether it is the commonly used Gibbs conditional random field or Markov random field, there are problems such as complex formulas and invariant solutions. These problems often lead to slow convergence, difficult convergence or even non-convergence during model training.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0064] Shown in conjunction with the accompanying drawings is a specific implementation of the image semantic segmentation method based on multi-level feature fusion and Gaussian conditional random field of the present invention. The basic frame diagram of the method is as follows figure 1 As shown, this method realizes the full convolution architecture of layer-by-layer fusion of multi-level features by constructing an image pyramid, trying to use a top-down tuning framework to replace the previously popular parallel pooling module, while obtaining features of different scales layer by layer Fusion, which ensures the preferential fusion of features between adjacent layers of the pyramid, maximally captures contextual information. In addition, the Gaussian conditional random field is used to further optimize the front-end output, capture more spatial details, and make the object boundaries in the segmentation effect map more accurate. The output of the final overall architectu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an image semantic segmentation method based on multi-level feature fusion and Gaussian conditional random field, including: 1) constructing an image pyramid; 2) using atrous convolution to keep the resolution of the feature map unchanged; 3) multi-level features Layer fusion tuning architecture; 4) upsampling with bilinear interpolation; 5) definition of loss function; 6) optimized output of Gaussian conditional random field. The present invention realizes the full convolution architecture of layer-by-layer fusion of multi-level features by constructing an image pyramid, and uses a top-down optimization framework to replace the previously popular parallel pooling module. The features between the adjacent layers of the pyramid are preferentially fused to capture the context information to the maximum extent, and the Gaussian conditional random field is used to further optimize the front-end output, capture more spatial details, and make the object boundary in the segmentation effect map more accurate, and finally the output of the overall architecture Get the best semantic segmentation effect.

Description

technical field [0001] The invention relates to an image semantic segmentation method based on multi-level feature fusion and Gaussian conditional random field. Background technique [0002] Early image segmentation only roughly divided the content of the image into several regions, but with the development of research, the roughness of image segmentation can no longer meet the needs of various applications, and then semantic segmentation was proposed. The semantics of an image refers to semantic information such as the category of objects or entities contained in an image or an image region, and image segmentation under semantics is called semantic segmentation. Image semantic segmentation can separate the foreground from the background in a single frame image, and identify the category of each foreground object, which is equivalent to assigning a semantic label to each pixel. Semantic segmentation is a major upgrade of image segmentation in both accuracy and fineness. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06T7/10

CPCG06T2207/20016G06T7/10

Inventor 周鹏程

Owner SUZHOU UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Image semantic segmentation method based on multi-level feature fusion and Gaussian conditional random field

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology