Real-time semantic segmentation system and method based on deep learning and weight distribution
A technology of weight distribution and semantic segmentation, which is applied in neural learning methods, character and pattern recognition, biological neural network models, etc., can solve the problems of unable to extract feature information and few network layers, and achieve the effect of simple structure and easy implementation
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0091] Embodiment: a kind of real-time semantic segmentation system based on deep learning and weight distribution, such as figure 1 As shown, it is characterized in that it includes the following modules: data acquisition module, data preprocessing module, encoding module, decoding module, weight distribution module and semantic segmentation prediction module; wherein, the data acquisition module collects the input image signal, and Output it to the input end of the data preprocessing module; the input end of the encoding module receives the processed image signal sent by the output end of the data preprocessing module, and the output end outputs the feature map signal, and outputs it to the input end of the decoding module ; The input end of the decoding module receives the feature map signal output by the output end of the encoding module, and outputs it to the weight distribution module or the semantic segmentation prediction module; the input end of the weight distribution...
specific Embodiment approach
[0162] This embodiment utilizes Python3 language and frameworks such as PyTorch1.5 to build a real-time semantic segmentation method based on deep learning and weight distribution. The main goal of segmentation is the segmentation accuracy, speed, and parameter amount for each category in the image. The specific implementation is as follows:
[0163] Data acquisition module: from https: / / www.cityscapes-dataset.com / Get the cityscapes dataset.
[0164] Data preprocessing module: This module performs data enhancement on the input image, including methods such as horizontal flip, vertical flip, cropping, and zooming in. like Figure 7-b As shown, the normalization operation is performed on the input image, and the pixels in the range of 0-255 are converted into pixels in the range of 0-1, so as to speed up the learning speed of the network, so that the mean value of all input samples is close to 0 or its mean square error Small in comparison. Finally, output a 512*1024 pixel...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


