Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Real-time high-performance street-view image semantic segmentation method based on deep learning

A technology of semantic segmentation and deep learning, applied in the field of computer vision, which can solve the problem of less research on fast semantic segmentation

Active Publication Date: 2019-08-30
XIAMEN UNIV
View PDF3 Cites 57 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, the work of improving computational efficiency mainly focuses on image classification and target tracking, and there are relatively few studies on fast semantic segmentation.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Real-time high-performance street-view image semantic segmentation method based on deep learning
  • Real-time high-performance street-view image semantic segmentation method based on deep learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] The method of the present invention will be described in detail below in conjunction with the accompanying drawings and embodiments. This embodiment is implemented on the premise of the technical solution of the present invention, and provides the implementation mode and specific operation process.

[0026] see figure 1 , the embodiment of the present invention includes the following steps:

[0027] A. Prepare Street View imagery training, validation, and test datasets.

[0028] The dataset used is the famous public dataset Cityscapes, which is a large-scale street view image understanding dataset with pixel-by-pixel semantic annotation, and its annotation contains 30 semantic classes. The dataset consists of 5,000 high-resolution street view images with fine annotations and 20,000 high-resolution images with rough annotations. The resolution of each image is 1024×2048. These images come from 50 different cities in different Shot in seasons and changing scenes. In th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a real-time high-performance street-view image semantic segmentation method based on deep learning. The real-time high-performance street-view image semantic segmentation method includes the steps: preparing a street-view image training, verifying and testing data set; carrying out downsampling on images of the data set to reduce the resolution of the images; transforming an existing lightweight classification network to serve as a basic feature extraction network of semantic segmentation; connecting identification hole space pyramid pooling in series after the basic feature extraction network for solving the multi-scale problem of semantic segmentation; stacking a plurality of convolutional layers to form a shallow spatial information storage network; fusing the obtained feature maps by using a feature fusion network to form a prediction result; comparing the output image with a semantic annotation image in the data set, and performing end-to-end training by using a back propagation algorithm to obtain a real-time high-performance street-view image semantic segmentation network model; and inputting the street-view image to be tested into the real-time high-performance street-view image semantic segmentation network model to obtain a semantic segmentation result of the street-view image.

Description

technical field [0001] The invention relates to computer vision technology, in particular to a method for semantic segmentation of real-time high-performance street view images based on deep learning. Background technique [0002] Semantic segmentation is one of the tasks of scene understanding. It provides detailed pixel-level classification and is a very basic but very challenging task in the field of computer vision. Semantic segmentation can be widely used in various real-world scenarios, such as unmanned driving, robots, or augmented reality, etc. These applications have a strong demand for semantic segmentation algorithms. [0003] Early semantic segmentation used methods based on manual features, such as random forests, etc., but the effects of these methods were not ideal. In recent years, with the continuous development of deep learning technology, deep convolutional neural networks have been widely used in various computer vision tasks, such as image classificatio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/08G06T7/10
CPCG06N3/084G06T7/10G06T2207/20081G06T2207/20084G06F18/213G06F18/253G06F18/214
Inventor 严严董根顺王菡子
Owner XIAMEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products