Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Semantic segmentation method based on residual pyramid pooling neural network

A pyramid pooling and semantic segmentation technology, applied in the field of indoor scene semantic segmentation, can solve the problems of limited ability to reconstruct accurate details and insufficient resolution, and achieve the effects of compensating losses, ensuring prediction accuracy, and reducing image size

Active Publication Date: 2020-01-24
ZHEJIANG UNIVERSITY OF SCIENCE AND TECHNOLOGY
View PDF7 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the encoder's most important feature map may be highly semantic, it has limited ability to reconstruct precise details in the segmentation map due to insufficient resolution, which is very common in modern backbone models.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Semantic segmentation method based on residual pyramid pooling neural network
  • Semantic segmentation method based on residual pyramid pooling neural network
  • Semantic segmentation method based on residual pyramid pooling neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The present invention will be described in further detail below in conjunction with the accompanying drawings and embodiments.

[0031] A semantic segmentation method based on residual pyramid pooling proposed by the present invention, its overall implementation block diagram is as follows figure 1 As shown, it includes two processes of training phase and testing phase;

[0032] The specific steps of the described training phase process are:

[0033] Step 1_1: Select the RGB image and depth image of N original images to form a training set, and mark the RGB image of the kth original image in the training set as The depth map of the original image is marked as The corresponding one-hot encoded label image is denoted as {G k (x, y)}; wherein, k is a positive integer, 1≤k≤N, 1≤x≤W, 1≤y≤H, W represents the width of the original image, and H represents the height of the original image, such as taking W= 640, H=480, R k (x,y) means The pixel value of the pixel whose ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a semantic segmentation method based on a residual pyramid pooling neural network. The method comprises the following steps: firstly, constructing a convolutional neural network, wherein a hidden layer of the convolutional neural network comprises 10 residual neural network blocks, 4 Residual ASPP blocks and 5 Basic blocks; inputting the original indoor scene image into a convolutional neural network for training to obtain a corresponding semantic segmentation prediction graph; calculating a loss function value between a set formed by semantic segmentation prediction images corresponding to the original indoor scene images and a set formed by 40 single-heat coded images processed by corresponding real semantic segmentation images to obtain an optimal weight vector and a bias term of a convolutional neural network classification training model; and in a test stage, inputting an indoor scene image to be semantically segmented into the convolutional neural networkclassification training model to obtain a semantic segmentation prediction graph. According to the invention, the semantic segmentation efficiency and accuracy of the indoor scene image are improved.

Description

technical field [0001] The invention is a semantic segmentation method based on a fully convolutional neural network, especially an indoor scene semantic segmentation method for residual pyramid pooling. Background technique [0002] Semantic segmentation is a fundamental technique for many computer vision applications, such as scene understanding, autonomous driving. With the development of Convolutional Neural Networks, especially Fully Convolutional Neural Networks (FCNs), many promising results have been achieved on benchmarks. FCN has a typical encoder-decoder structure—semantic information is first embedded into the feature map through the encoder, and the decoder is responsible for generating segmentation results. Typically, the encoder is a pre-trained convolutional model to extract image features, and the decoder contains multiple upsampling components to recover resolution. Although the encoder's most important feature map may be highly semantic, it has limited a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06T7/10G06N3/04G06N3/08
CPCG06T7/10G06N3/08G06T2207/10024G06T2207/20016G06T2207/20081G06T2207/20084G06N3/045
Inventor 周武杰吕思嘉雷景生何成王海江
Owner ZHEJIANG UNIVERSITY OF SCIENCE AND TECHNOLOGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products