Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Semantic Segmentation Method for Street Scene Parsing for Autonomous Driving

A semantic segmentation and automatic driving technology, applied in the direction of instruments, biological neural network models, computing, etc., can solve the problems that affect the overall understanding and judgment, and cannot obtain long-distance context information, so as to improve the efficiency of semantic segmentation and reduce the amount of calculation. Effect

Active Publication Date: 2022-07-22
SOUTHWEST PETROLEUM UNIV
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the existing network uses a two-dimensional square pooling operator to aggregate local area features to achieve feature pyramids and pyramid pooling modes with consistent size and proportion. This square pooling mode with a pooling aggregation range of a square area can only Aggregating object information in a local area cannot obtain effective long-distance context information
In addition, for some objects with irregular shapes and sizes, such as trees and utility poles, the two-dimensional square pooling operator will inevitably introduce irrelevant noise information, which will affect the network's overall understanding and judgment of features.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Semantic Segmentation Method for Street Scene Parsing for Autonomous Driving
  • A Semantic Segmentation Method for Street Scene Parsing for Autonomous Driving
  • A Semantic Segmentation Method for Street Scene Parsing for Autonomous Driving

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0055] like figure 2 and image 3A semantic segmentation method for street scene parsing for autonomous driving is shown, including the following steps:

[0056] Constructing an image semantic segmentation network, the image semantic segmentation network is used to downsample the image to obtain an initial feature map, perform one-dimensional horizontal pooling on the initial feature map to obtain a first global feature map, and perform a one-dimensional horizontal pooling process on the initial feature map. The image is subjected to one-dimensional vertical pooling processing to obtain a second global feature map, and the first global feature map and the second global feature map are fused to generate an output image;

[0057] Collect training pictures, and use the training pictures to train the image semantic segmentation network;

[0058] Use the trained image semantic segmentation network to semantically segment the image to be processed;

[0059] Wherein, the one-dime...

Embodiment 2

[0067] On the basis of Example 1, as Figure 4 As shown, the image semantic segmentation network performs pyramid pooling on the initial feature map to obtain a local feature map, fuses the local feature map, the first global feature map and the second global feature map, and generates an output image; the The pyramid pooling process includes paralleling at least two scales of small pooling layers to aggregate the regional features of the corresponding scales. Sampling and restoration to obtain local feature maps.

[0068] Preferably, as Figure 4 As shown, the pyramid pooling process parallelizes two scales of small pooling layers to aggregate regional features. In one or more embodiments, a two-dimensional conventional convolutional layer (2D Conv) is utilized to process and extract multi-scale feature information.

Embodiment 3

[0070] On the basis of the above-mentioned embodiment, as Figure 5 As shown in the figure, the high-level feature map and the low-level feature map are obtained after fusing each feature map, and the high-level feature map and the low-level feature map are weighted and added to generate the output image. In this embodiment, after the first, second, and third global feature maps and local feature maps are fused, weights are redistributed to the high-level features in the obtained high-level feature maps and the low-level features in the low-level feature maps, so that the same channel The weights of the high-level and low-level features are not necessarily equal, and then the weighted high-level feature maps and low-level feature maps are added and fused to achieve the complementary fusion of high-level and low-level features in terms of semantics and details. like Figure 7 As shown, in the FFM module, after the weighted high-level feature map and low-level feature map are a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A semantic segmentation method for street scene parsing for automatic driving, comprising the following steps: constructing an image semantic segmentation network, the image semantic segmentation network is used to downsample an image to obtain an initial feature map, and a one-dimensional horizontal pooling is performed on the initial feature map. The first global feature map is obtained by performing a one-dimensional vertical pooling process on the initial feature map, and the first global feature map and the second global feature map are fused to generate an output image; Image training the image semantic segmentation network; using the trained image semantic segmentation network to perform semantic segmentation on the image to be processed. The invention utilizes the long and narrow pooling method of the global one-dimensional pooling mechanism, which can directly and effectively aggregate all the information in the horizontal and vertical directions, link a large amount of information to form effective context information, and make up for the The shortcomings of traditional rectangular pooling in aggregating long-distance context information.

Description

technical field [0001] The invention relates to the field of image semantic segmentation, in particular to a street scene parsing semantic segmentation method for automatic driving. Background technique [0002] Due to the increasing number of car users, the problems of road traffic congestion and safety accidents have become more and more serious. With the support of vehicle networking technology and artificial intelligence technology, autonomous driving technology can coordinate travel routes and planning time, thereby greatly improving travel efficiency and reducing energy consumption to a certain extent. For fast visual tasks such as automatic driving, the accuracy and efficiency of image semantic segmentation are very important, but the current semantic segmentation network cannot achieve a good balance between the two. [0003] At present, in order to improve the efficiency of semantic segmentation, a large number of lightweight network researches applied to real-time...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06V20/56G06V10/26G06V10/80G06V10/82G06K9/62G06N3/04
Inventor 张强温杰宾万敏鲍海龙廖茁栋唐斌
Owner SOUTHWEST PETROLEUM UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products