Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

An image semantic segmentation method based on deep learning

A semantic segmentation and deep learning technology, applied in the field of computer vision and deep learning, can solve the problems of insufficient use of image local feature information and global context dependencies, rough edges of image segmentation, and missing local features, etc., to improve the generalization ability. , adding richness and precise border effects

Active Publication Date: 2019-05-03
SHAANXI NORMAL UNIV
View PDF14 Cites 114 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Disadvantages of the above semantic segmentation methods: first, the model input is generally an RGB image, and the input is too single, which may lead to the loss of local features; second, these methods are based on convolutional neural networks for feature extraction, and do not make full use of images. The local feature information and global context dependence of the image lead to very rough segmentation edges of the image and very low segmentation accuracy.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An image semantic segmentation method based on deep learning
  • An image semantic segmentation method based on deep learning
  • An image semantic segmentation method based on deep learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0042] The WeizmannHorse dataset is an image segmentation dataset consisting of 328 images. Some images in the dataset are as follows: image 3 As shown, the training of the network model uses the Pytorch platform, and the code is written on python. This embodiment is based on the image semantic segmentation method of deep learning, such as figure 1 As shown, the steps are as follows:

[0043] S1. Data set processing

[0044] Randomly select 200 images from the WeizmannHorse dataset as a training image set, and the remaining 128 images as a test image set, and perform data enhancement operations on the training image set to increase the number of training images to 11,000;

[0045] S2. Build a deep semantic segmentation network

[0046] The deep semantic segmentation network consists of a parallel deep neural network module, a feature fusion module, and a Softmax classification layer. The parallel deep neural network module is used to extract features from the input image, a...

Embodiment 2

[0077] The StanfordBackground dataset is an image segmentation dataset consisting of 715 images. Some images in the dataset are as follows: Figure 4 As shown, the training of the network model uses the Pytorch platform, and the code is written in python.

[0078]This embodiment is based on the image semantic segmentation method of deep learning. In step S1, 573 images are randomly selected from the StanfordBackground data set as a training image set, and the remaining 142 images are used as a test image set, and the data enhancement operation is performed on the training image set. The number of training images is increased to 13,752; in step S32, the size of the image in the training image set after data enhancement is cut to 421×421, and the training image is used to train the deep semantic segmentation network to generate a pixel category prediction label probability distribution map, The prediction loss is calculated using the predicted label probability and the original ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an image semantic segmentation method based on deep learning. The method comprises four parts of data set processing, deep semantic segmentation network construction, deep semantic segmentation network training and parameter learning, and semantic segmentation on a test image. The RGB image and the gray level image of the input image are used as the input of the network model, the edge information of the gray level image is fully utilized, and the richness degree of input characteristics is effectively increased; a convolutional neural network and a bidirectional threshold recursion unit are combined, and more context dependency relationships and global feature information are captured on the basis of learning image local features; coordinate information is added tothe feature map through the first coordinate channel module and the second coordinate channel module, the coordinate features of the model are enriched, the generalization ability of the model is improved, and a semantic segmentation result with high resolution and accurate boundary is generated.

Description

technical field [0001] The invention belongs to the technical field of computer vision and deep learning, and in particular relates to an image semantic segmentation method based on deep learning. Background technique [0002] Image semantic segmentation is to understand and recognize the content of pictures from the pixel level. Its purpose is to establish a one-to-one mapping relationship between each pixel and semantic categories, and perform segmentation based on semantic information. It is widely used in scene understanding and automatic driving. , medical image analysis, robot vision and other fields. Image semantic segmentation is the cornerstone of image understanding, and the quality of its segmentation results will directly affect the processing of subsequent image content. Therefore, the research on image semantic segmentation technology has very important practical significance. [0003] Most of the traditional image semantic segmentation methods rely on manual ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/34G06K9/62G06N3/04G06N3/08
CPCY02T10/40
Inventor 郭敏丁晓马苗陈昱莅裴炤
Owner SHAANXI NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products