Street scene analysis semantic segmentation method for automatic driving

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of semantic segmentation and automatic driving, applied in instruments, biological neural network models, calculations, etc., can solve problems that affect the overall understanding and judgment, and cannot obtain long-distance context information, so as to improve the efficiency of semantic segmentation and reduce the amount of calculation Effect

Active Publication Date: 2022-04-12

SOUTHWEST PETROLEUM UNIV

View PDF10 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] However, the existing network uses a two-dimensional square pooling operator to aggregate local area features to achieve feature pyramids and pyramid pooling modes with consistent size and proportion. This square pooling mode with a pooling aggregation range of a square area can only Aggregating object information in a local area cannot obtain effective long-distance context information

In addition, for some objects with irregular shapes and sizes, such as trees and utility poles, the two-dimensional square pooling operator will inevitably introduce irrelevant noise information, which will affect the network's overall understanding and judgment of features.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0055] Such as figure 2 and image 3A street scene parsing semantic segmentation method for automatic driving is shown, comprising the following steps:

[0056] Constructing an image semantic segmentation network, the image semantic segmentation network is used to down-sample the image to obtain an initial feature map, and perform one-dimensional horizontal pooling processing on the initial feature map to obtain a first global feature map, and the initial feature map performing a one-dimensional vertical pooling process on the graph to obtain a second global feature map, fusing the first global feature map and the second global feature map to generate an output image;

[0057] Collecting training pictures, using the training pictures to train the image semantic segmentation network;

[0058] Use the trained image semantic segmentation network to perform semantic segmentation on the image to be processed;

[0059] Wherein, the one-dimensional horizontal pooling process incl...

Embodiment 2

[0067] On the basis of Example 1, such as Figure 4 As shown, the image semantic segmentation network performs pyramid pooling processing on the initial feature map to obtain a local feature map, fuses the local feature map, the first global feature map and the second global feature map, and generates an output image; Pyramid pooling processing includes small pooling layers of at least two scales in parallel to aggregate regional features of corresponding scales, after convolutional layer processing and batch normalization processing of regional features of each scale, activation by activation function, through the above Sampling restores to obtain local feature maps.

[0068] Preferably, as Figure 4 As shown, the pyramid pooling process parallelizes two scales of small pooling layers to aggregate regional features. In one or more embodiments, a two-dimensional conventional convolutional layer (2D Conv) is used to process and extract multi-scale feature information.

Embodiment 3

[0070] On the basis of the above examples, if Figure 5 As shown in Fig. 1, the high-level feature map and low-level feature map are obtained after fusing each feature map, and the high-level feature map and low-level feature map are weighted and added to generate an output image. In this embodiment, after fusing the first, second, and third global feature maps and local feature maps, redistribute the weights of the high-level features in the obtained high-level feature map and the low-level features in the low-level feature map, so that the same channel The weights of high-level and low-level features are not necessarily equal, and then the weighted high-level feature maps and low-level feature maps are added and fused to achieve complementary fusion of high-level and low-level features in terms of semantics and details. Such as Figure 7 As shown, in the FFM module, after adding the weighted high-level feature maps and low-level feature maps, batch normalization is performe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

A street scene analysis semantic segmentation method for automatic driving comprises the following steps that an image semantic segmentation network is constructed, the image semantic segmentation network is used for performing down-sampling processing on an image to obtain an initial feature map, and one-dimensional horizontal pooling processing is performed on the initial feature map to obtain a first global feature map; performing one-dimensional vertical pooling processing on the initial feature map to obtain a second global feature map, fusing the first global feature map and the second global feature map, and generating an output image; collecting a training picture, and training the image semantic segmentation network by using the training picture; and performing semantic segmentation on a to-be-processed image by using the trained image semantic segmentation network. The long and narrow pooling mode of a global one-dimensional pooling mechanism is utilized, all information in the horizontal direction and the vertical direction can be directly and effectively aggregated, a large amount of information is associated, effective context information is formed, and the defects of traditional rectangular pooling in long-distance context information aggregation are overcome.

Description

technical field [0001] The invention relates to the field of image semantic segmentation, in particular to a street scene analysis semantic segmentation method for automatic driving. Background technique [0002] Due to the continuous increase of car users, road traffic congestion, safety accidents and other problems are becoming more and more serious. With the support of Internet of Vehicles technology and artificial intelligence technology, autonomous driving technology can coordinate travel routes and planning time, thereby greatly improving travel efficiency and reducing energy consumption to a certain extent. For fast visual tasks such as autonomous driving, both the accuracy and efficiency of image semantic segmentation are very important, but the current semantic segmentation network cannot achieve a good balance between the two. [0003] At present, in order to improve the efficiency of semantic segmentation, a large number of lightweight network researches applied ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06V20/56G06V10/26G06V10/80G06V10/82G06K9/62G06N3/04

Inventor 张强温杰宾万敏鲍海龙廖茁栋唐斌

Owner SOUTHWEST PETROLEUM UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Street scene analysis semantic segmentation method for automatic driving

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology