Semantic segmentation method and system based on pixel rearrangement reconstruction for streetscape understanding

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A semantic segmentation and pixel rearrangement technology, applied in the field of computer vision, can solve the problems of insufficient ability to repair details, inability to represent, and inability to reconstruct feature features, etc.

Active Publication Date: 2019-07-26

FUZHOU UNIV

View PDF3 Cites 11 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

At the same time, in the encoding process, in order to better capture the characteristics of objects of different sizes, different receptive fields and scale information are often combined, such as spatial pyramid pooling technology with holes, which cannot effectively represent the features of the point itself, and calculate different scales. Smaller scale features are not reused when feature features

At the same time, in the existing semantic segmentation methods, transposed convolution or bilinear interpolation methods are generally used to expand the features step by step in the decoding process, so the feature size is increased step by step, and the reconstruction feature cannot be effectively reconstructed. feature reuse

And in this process, shallow features are often added to optimize the decoding process, but there is no clear optimization target for shallow features, so the ability to repair details in the reconstruction process is slightly insufficient

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0052] The technical solution of the present invention will be specifically described below in conjunction with the accompanying drawings.

[0053] The present invention provides a semantic segmentation method based on pixel rearrangement and reconstruction for street view understanding, such as figure 1 shown, including the following steps:

[0054] Step A: Preprocess the input image of the training set. First, the image is normalized by subtracting its image mean value, and then the image is randomly cut to a uniform size to obtain a preprocessed image of the same size.

[0055] Step B: Extract a general feature F with a general convolutional network backbone , and then based on the general feature F backbone Obtain dense spatial pyramid fusion feature F with holes daspp , used to capture multi-scale context information, and then use the two-part concatenation described in step B as an encoding network to extract encoding features F encoder ; Concretely include the follo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a semantic segmentation method and system based on pixel rearrangement reconstruction for streetscape understanding, and the method comprises the steps: carrying out the preprocessing of an input image of a training set, enabling the image to be standardized, and obtaining preprocessed images with the same size; extracting general features by using a convolutional network,then obtaining dense perforated space pyramid fusion features, and extracting coding features by using the two parts of cascade connection as a coding network; selecting an intermediate layer featurefrom the convolutional network, calculating an edge feature by combining the encoding feature, taking a dense network based on a pixel rearrangement technology as a decoding network, reconstructing the image resolution, and calculating the decoding feature; performing calculating to obtain semantic segmentation loss and auxiliary supervision edge loss, and training the deep neural network by taking minimization of weighted sum loss of the semantic segmentation loss and the auxiliary supervision edge loss as a target; and performing semantic segmentation on the to-be-segmented image by using the deep neural network model, and outputting a segmentation result. The method and the system are beneficial to improving the accuracy and robustness of image semantic segmentation.

Description

technical field [0001] The present invention relates to the technical field of computer vision, in particular to a semantic segmentation method and system for street scene understanding based on pixel rearrangement and reconstruction. Background technique [0002] Image semantic segmentation is an important branch of computer vision in the field of artificial intelligence and an important part of image understanding in machine vision. Image semantic segmentation is to accurately classify each pixel in the image to its category, making it consistent with the visual representation of the image itself, so the image semantic segmentation task is also called pixel-level image classification task. [0003] Due to the similarity between image semantic segmentation and image classification, various image classification networks are often used as the backbone of the image semantic segmentation network after removing the last fully connected layer, and are interchangeable with each ot...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06K9/62G06N3/04

CPCG06N3/045G06F18/2155G06F18/214G06F18/24

Inventor 陈羽中林洋洋柯逍黄腾达

Owner FUZHOU UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Semantic segmentation method and system based on pixel rearrangement reconstruction for streetscape understanding

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology