Street scene analysis semantic segmentation method for automatic driving
A technology of semantic segmentation and automatic driving, applied in instruments, biological neural network models, calculations, etc., can solve problems that affect the overall understanding and judgment, and cannot obtain long-distance context information, so as to improve the efficiency of semantic segmentation and reduce the amount of calculation Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0055] Such as figure 2 and image 3A street scene parsing semantic segmentation method for automatic driving is shown, comprising the following steps:
[0056] Constructing an image semantic segmentation network, the image semantic segmentation network is used to down-sample the image to obtain an initial feature map, and perform one-dimensional horizontal pooling processing on the initial feature map to obtain a first global feature map, and the initial feature map performing a one-dimensional vertical pooling process on the graph to obtain a second global feature map, fusing the first global feature map and the second global feature map to generate an output image;
[0057] Collecting training pictures, using the training pictures to train the image semantic segmentation network;
[0058] Use the trained image semantic segmentation network to perform semantic segmentation on the image to be processed;
[0059] Wherein, the one-dimensional horizontal pooling process incl...
Embodiment 2
[0067] On the basis of Example 1, such as Figure 4 As shown, the image semantic segmentation network performs pyramid pooling processing on the initial feature map to obtain a local feature map, fuses the local feature map, the first global feature map and the second global feature map, and generates an output image; Pyramid pooling processing includes small pooling layers of at least two scales in parallel to aggregate regional features of corresponding scales, after convolutional layer processing and batch normalization processing of regional features of each scale, activation by activation function, through the above Sampling restores to obtain local feature maps.
[0068] Preferably, as Figure 4 As shown, the pyramid pooling process parallelizes two scales of small pooling layers to aggregate regional features. In one or more embodiments, a two-dimensional conventional convolutional layer (2D Conv) is used to process and extract multi-scale feature information.
Embodiment 3
[0070] On the basis of the above examples, if Figure 5 As shown in Fig. 1, the high-level feature map and low-level feature map are obtained after fusing each feature map, and the high-level feature map and low-level feature map are weighted and added to generate an output image. In this embodiment, after fusing the first, second, and third global feature maps and local feature maps, redistribute the weights of the high-level features in the obtained high-level feature map and the low-level features in the low-level feature map, so that the same channel The weights of high-level and low-level features are not necessarily equal, and then the weighted high-level feature maps and low-level feature maps are added and fused to achieve complementary fusion of high-level and low-level features in terms of semantics and details. Such as Figure 7 As shown, in the FFM module, after adding the weighted high-level feature maps and low-level feature maps, batch normalization is performe...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com