Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Weakly Supervised Image Semantic Segmentation Method Based on Spatial Pyramid Mask Pooling

A space pyramid and semantic segmentation technology, applied in the field of computer vision, to reduce the risk of training failure, rich in local features, and universal

Active Publication Date: 2021-09-21
成都图必优科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

On the other hand, although the pyramid model has the characteristics of multi-scale information and local information fusion, the problem of how to better mine the sub-regional semantic information on the basis of mastering the global information has not been completely solved.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Weakly Supervised Image Semantic Segmentation Method Based on Spatial Pyramid Mask Pooling
  • A Weakly Supervised Image Semantic Segmentation Method Based on Spatial Pyramid Mask Pooling
  • A Weakly Supervised Image Semantic Segmentation Method Based on Spatial Pyramid Mask Pooling

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] Refer to attached Figure 1-4 , the embodiments of the present invention will be described in detail.

[0058] A weakly supervised image semantic segmentation method based on spatial pyramid mask pooling, comprising the following steps:

[0059] Step 1: Select a convolutional neural network H, and process the input image X through the convolutional neural network H to obtain a classification feature map;

[0060] Step 2: Establish a spatial pyramid pooling module based on the classification feature map, and then perform spatial pyramid masking to obtain an output feature map;

[0061] Step 3: Calculate the category activation vector and category probability vector according to the output feature map, and then establish a competitive spatial pyramid masking pooling loss function;

[0062] Step 4: Train the convolutional neural network H according to the competitive spatial pyramid masking pooling loss function and extract segmentation feature maps.

[0063] Further, t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a weakly supervised image semantic segmentation method based on spatial pyramid mask pooling, comprising the following steps: selecting a convolutional neural network H, processing an input image X through the convolutional neural network H, and obtaining a classification feature map; The classification feature map establishes the spatial pyramid pooling module, and then performs spatial pyramid masking to obtain the output feature map; calculates the category activation vector and category probability vector according to the output feature map, and then establishes a competitive spatial pyramid to cover the pooling loss function; according to the competitive space The pyramid masking pooling loss function trains a convolutional neural network H and extracts segmentation feature maps. The invention realizes a weakly supervised image semantic segmentation model with richer local features, more perfect regional feature mining, and more robust target size and attitude, improves the extraction ability of local semantic information, and strengthens local targets or parts in weakly supervised semantic segmentation recognition ability.

Description

technical field [0001] The invention belongs to the technical field of computer vision, and in particular relates to a method for semantic segmentation of weakly supervised images based on spatial pyramid masking pooling. Background technique [0002] Image semantic segmentation is a basic computer vision task, and its goal is to classify all pixels in the image. Since it can fully understand images at the pixel level, it is useful for other vision tasks such as image classification and object recognition. However, because the production of pixel-level label data requires a lot of energy, fully supervised image semantic segmentation is difficult to quickly achieve large-scale expansion, so weakly supervised image semantic segmentation methods that rely on image category labels have been widely studied. [0003] In past computer vision research, the pyramid model has been used a lot. The well-known SIFT algorithm extracts key point description features through the Laplacian...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06T7/11G06K9/62G06N3/04
CPCG06T7/11G06T2207/20016G06N3/045G06F18/2415G06F18/214
Inventor 朱策段昶文宏雕徐榕健
Owner 成都图必优科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products