Street scene analysis semantic segmentation method for automatic driving

A technology of semantic segmentation and automatic driving, applied in instruments, biological neural network models, calculations, etc., can solve problems that affect the overall understanding and judgment, and cannot obtain long-distance context information, so as to improve the efficiency of semantic segmentation and reduce the amount of calculation Effect

Active Publication Date: 2022-04-12
SOUTHWEST PETROLEUM UNIV
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the existing network uses a two-dimensional square pooling operator to aggregate local area features to achieve feature pyramids and pyramid pooling modes with consistent size and proportion. This square pooling mode with a pooling aggregation range of a square area can only Aggregating object information in a local area cannot obtain effective long-distance context information
In addition, for some objects with irregular shapes and sizes, such as trees and utility poles, the two-dimensional square pooling operator will inevitably introduce irrelevant noise information, which will affect the network's overall understanding and judgment of features.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Street scene analysis semantic segmentation method for automatic driving
  • Street scene analysis semantic segmentation method for automatic driving
  • Street scene analysis semantic segmentation method for automatic driving

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0055] Such as figure 2 and image 3A street scene parsing semantic segmentation method for automatic driving is shown, comprising the following steps:

[0056] Constructing an image semantic segmentation network, the image semantic segmentation network is used to down-sample the image to obtain an initial feature map, and perform one-dimensional horizontal pooling processing on the initial feature map to obtain a first global feature map, and the initial feature map performing a one-dimensional vertical pooling process on the graph to obtain a second global feature map, fusing the first global feature map and the second global feature map to generate an output image;

[0057] Collecting training pictures, using the training pictures to train the image semantic segmentation network;

[0058] Use the trained image semantic segmentation network to perform semantic segmentation on the image to be processed;

[0059] Wherein, the one-dimensional horizontal pooling process incl...

Embodiment 2

[0067] On the basis of Example 1, such as Figure 4 As shown, the image semantic segmentation network performs pyramid pooling processing on the initial feature map to obtain a local feature map, fuses the local feature map, the first global feature map and the second global feature map, and generates an output image; Pyramid pooling processing includes small pooling layers of at least two scales in parallel to aggregate regional features of corresponding scales, after convolutional layer processing and batch normalization processing of regional features of each scale, activation by activation function, through the above Sampling restores to obtain local feature maps.

[0068] Preferably, as Figure 4 As shown, the pyramid pooling process parallelizes two scales of small pooling layers to aggregate regional features. In one or more embodiments, a two-dimensional conventional convolutional layer (2D Conv) is used to process and extract multi-scale feature information.

Embodiment 3

[0070] On the basis of the above examples, if Figure 5 As shown in Fig. 1, the high-level feature map and low-level feature map are obtained after fusing each feature map, and the high-level feature map and low-level feature map are weighted and added to generate an output image. In this embodiment, after fusing the first, second, and third global feature maps and local feature maps, redistribute the weights of the high-level features in the obtained high-level feature map and the low-level features in the low-level feature map, so that the same channel The weights of high-level and low-level features are not necessarily equal, and then the weighted high-level feature maps and low-level feature maps are added and fused to achieve complementary fusion of high-level and low-level features in terms of semantics and details. Such as Figure 7 As shown, in the FFM module, after adding the weighted high-level feature maps and low-level feature maps, batch normalization is performe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A street scene analysis semantic segmentation method for automatic driving comprises the following steps that an image semantic segmentation network is constructed, the image semantic segmentation network is used for performing down-sampling processing on an image to obtain an initial feature map, and one-dimensional horizontal pooling processing is performed on the initial feature map to obtain a first global feature map; performing one-dimensional vertical pooling processing on the initial feature map to obtain a second global feature map, fusing the first global feature map and the second global feature map, and generating an output image; collecting a training picture, and training the image semantic segmentation network by using the training picture; and performing semantic segmentation on a to-be-processed image by using the trained image semantic segmentation network. The long and narrow pooling mode of a global one-dimensional pooling mechanism is utilized, all information in the horizontal direction and the vertical direction can be directly and effectively aggregated, a large amount of information is associated, effective context information is formed, and the defects of traditional rectangular pooling in long-distance context information aggregation are overcome.

Description

technical field [0001] The invention relates to the field of image semantic segmentation, in particular to a street scene analysis semantic segmentation method for automatic driving. Background technique [0002] Due to the continuous increase of car users, road traffic congestion, safety accidents and other problems are becoming more and more serious. With the support of Internet of Vehicles technology and artificial intelligence technology, autonomous driving technology can coordinate travel routes and planning time, thereby greatly improving travel efficiency and reducing energy consumption to a certain extent. For fast visual tasks such as autonomous driving, both the accuracy and efficiency of image semantic segmentation are very important, but the current semantic segmentation network cannot achieve a good balance between the two. [0003] At present, in order to improve the efficiency of semantic segmentation, a large number of lightweight network researches applied ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06V20/56G06V10/26G06V10/80G06V10/82G06K9/62G06N3/04
Inventor 张强温杰宾万敏鲍海龙廖茁栋唐斌
Owner SOUTHWEST PETROLEUM UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products