A Semantic Segmentation Method for Street Scene Parsing for Autonomous Driving

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A semantic segmentation and automatic driving technology, applied in the direction of instruments, biological neural network models, computing, etc., can solve the problems that affect the overall understanding and judgment, and cannot obtain long-distance context information, so as to improve the efficiency of semantic segmentation and reduce the amount of calculation. Effect

Active Publication Date: 2022-07-22

SOUTHWEST PETROLEUM UNIV

View PDF6 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] However, the existing network uses a two-dimensional square pooling operator to aggregate local area features to achieve feature pyramids and pyramid pooling modes with consistent size and proportion. This square pooling mode with a pooling aggregation range of a square area can only Aggregating object information in a local area cannot obtain effective long-distance context information

In addition, for some objects with irregular shapes and sizes, such as trees and utility poles, the two-dimensional square pooling operator will inevitably introduce irrelevant noise information, which will affect the network's overall understanding and judgment of features.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0055] like figure 2 and image 3A semantic segmentation method for street scene parsing for autonomous driving is shown, including the following steps:

[0056] Constructing an image semantic segmentation network, the image semantic segmentation network is used to downsample the image to obtain an initial feature map, perform one-dimensional horizontal pooling on the initial feature map to obtain a first global feature map, and perform a one-dimensional horizontal pooling process on the initial feature map. The image is subjected to one-dimensional vertical pooling processing to obtain a second global feature map, and the first global feature map and the second global feature map are fused to generate an output image;

[0057] Collect training pictures, and use the training pictures to train the image semantic segmentation network;

[0058] Use the trained image semantic segmentation network to semantically segment the image to be processed;

[0059] Wherein, the one-dime...

Embodiment 2

[0067] On the basis of Example 1, as Figure 4 As shown, the image semantic segmentation network performs pyramid pooling on the initial feature map to obtain a local feature map, fuses the local feature map, the first global feature map and the second global feature map, and generates an output image; the The pyramid pooling process includes paralleling at least two scales of small pooling layers to aggregate the regional features of the corresponding scales. Sampling and restoration to obtain local feature maps.

[0068] Preferably, as Figure 4 As shown, the pyramid pooling process parallelizes two scales of small pooling layers to aggregate regional features. In one or more embodiments, a two-dimensional conventional convolutional layer (2D Conv) is utilized to process and extract multi-scale feature information.

Embodiment 3

[0070] On the basis of the above-mentioned embodiment, as Figure 5 As shown in the figure, the high-level feature map and the low-level feature map are obtained after fusing each feature map, and the high-level feature map and the low-level feature map are weighted and added to generate the output image. In this embodiment, after the first, second, and third global feature maps and local feature maps are fused, weights are redistributed to the high-level features in the obtained high-level feature maps and the low-level features in the low-level feature maps, so that the same channel The weights of the high-level and low-level features are not necessarily equal, and then the weighted high-level feature maps and low-level feature maps are added and fused to achieve the complementary fusion of high-level and low-level features in terms of semantics and details. like Figure 7 As shown, in the FFM module, after the weighted high-level feature map and low-level feature map are a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

A semantic segmentation method for street scene parsing for automatic driving, comprising the following steps: constructing an image semantic segmentation network, the image semantic segmentation network is used to downsample an image to obtain an initial feature map, and a one-dimensional horizontal pooling is performed on the initial feature map. The first global feature map is obtained by performing a one-dimensional vertical pooling process on the initial feature map, and the first global feature map and the second global feature map are fused to generate an output image; Image training the image semantic segmentation network; using the trained image semantic segmentation network to perform semantic segmentation on the image to be processed. The invention utilizes the long and narrow pooling method of the global one-dimensional pooling mechanism, which can directly and effectively aggregate all the information in the horizontal and vertical directions, link a large amount of information to form effective context information, and make up for the The shortcomings of traditional rectangular pooling in aggregating long-distance context information.

Description

technical field [0001] The invention relates to the field of image semantic segmentation, in particular to a street scene parsing semantic segmentation method for automatic driving. Background technique [0002] Due to the increasing number of car users, the problems of road traffic congestion and safety accidents have become more and more serious. With the support of vehicle networking technology and artificial intelligence technology, autonomous driving technology can coordinate travel routes and planning time, thereby greatly improving travel efficiency and reducing energy consumption to a certain extent. For fast visual tasks such as automatic driving, the accuracy and efficiency of image semantic segmentation are very important, but the current semantic segmentation network cannot achieve a good balance between the two. [0003] At present, in order to improve the efficiency of semantic segmentation, a large number of lightweight network researches applied to real-time...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G06V20/56G06V10/26G06V10/80G06V10/82G06K9/62G06N3/04

Inventor 张强温杰宾万敏鲍海龙廖茁栋唐斌

Owner SOUTHWEST PETROLEUM UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

A Semantic Segmentation Method for Street Scene Parsing for Autonomous Driving

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology