Semi-supervised image semantic segmentation method and device based on self-supervised low-rank network

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A semantic segmentation and semi-supervised technology, applied in image analysis, image enhancement, image data processing, etc., can solve the problems of gradient flow backpropagation, increasing error imitation and accumulation, etc., to enhance consistency, solve ambiguity or The effect of misprediction

Active Publication Date: 2021-08-06

TIANJIN UNIV +1

View PDF8 Cites 3 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, this also increases the risk of error being imitated and accumulated, and there will be a phenomenon of gradient flow backpropagating from the top layer to the lower layer

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0099] To evaluate the effectiveness of SLRNet, extensive experiments are conducted on the Pascal VOC 2012 dataset, which is a widely used WSSS evaluation benchmark. In addition, an enhanced training set is constructed by adding annotations. There are a total of 10582 images in the dataset for training and 1449 images for validation.

[0100] 1. Model selection and use

[0101] 1. Cross-view self-supervision framework

[0102] A cross-view self-supervision framework is proposed, and by combining the LR low-rank module, the compounding effect caused by the self-supervision error of the single-level WSSS model is effectively alleviated. SLRNet simultaneously predicts multiple segmentation templates for multiple augmented views of an image, and then merges to generate accurate pseudo-labels as cross-view self-supervision. The supervision of cross-views helps to utilize the supplementary information from various augmented views to strengthen the consistency of predictions.

[...

Embodiment 2

[0123] In order to verify the superiority of the method and the effectiveness of each module, a large number of ablation experiments were carried out in the embodiment of the present invention, as described below for details:

[0124] To understand the impact of individual data augmentation on weakly supervised segmentation, several geometric and appearance augmentation modalities are considered. In addition, more attention is paid to reversible and differentiable geometric transformations, such as scaling and flipping, etc.

[0125] First, randomly crop the image to a size of 321×321. Then, apply target transformations to different branches. The composition of three transformations is studied: fixed-rate rescaling, random horizontal flipping, and random color distortions (such as brightness, contrast, saturation, and hue). Under supervised settings, strong color distortion does not improve or even hurt performance. So, for Brightness, Contrast, and Saturation, set the Maxi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to view more

PUM

Login to view more

Abstract

The invention discloses a semi-supervised image semantic segmentation method and device based on a self-supervised low-rank network, and the method comprises the steps that: the self-supervised low-rank network is constructed, the inverse geometric transformations are performed masks from two branches, a pseudo mask is generated through an optimization module, and the pseudo mask is input into an LR low-rank module; in each iteration, an assignment matrix P is calculated through softmax normalization attention and a temperature coefficient; the optimal basis mu is updated by aggregating the input feature X, and after a softmax normalized class activation graph A with the class being C and a deep feature X1 are obtained, the kth initialization basis is calculated through a weighted average value; and, in the base initialization process, a target function composed of classification loss and pseudo mask segmentation loss is used for supervision, an output result of an LR low-rank module is decoded and optimized, and the self-supervision low-rank network is updated according to the loss. The device comprises a construction module, an optimization module, an LR low-rank module, an updating module, a prediction module, a supervision module and an output module.

Description

technical field [0001] The invention relates to the field of image semantic segmentation, in particular to a semi-supervised image semantic segmentation method and device based on a self-supervised low-rank network. Background technique [0002] Recently, deep learning-based semantic segmentation models have achieved significant progress by training with large-scale pixel-level labels. However, this supervised approach requires extensive human annotation, which is time-consuming and expensive. In order to reduce the workload of annotating pixel-level labels, a large number of studies have developed Weakly Supervised Semantic Segmentation methods (WSSS) with low-cost annotations, such as: bounding boxes, scribbles, points, and image-level labels. [0003] Most of the popular image-level WSSS methods need to go through multiple training and optimization stages to obtain more accurate pseudo-labels. These methods usually start with weakly supervised localization, e.g. Class A...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to view more

Application Information

Patent Timeline

Login to view more

Patent Type & Authority Applications(China)

IPC IPC(8): G06T7/10G06N3/04G06N3/08G06K9/62

CPCG06T7/10G06N3/08G06T2207/10004G06N3/045G06F18/23

Inventor 朱鹏飞潘俊文徐玮毅王汉石赵帅胡清华

Owner TIANJIN UNIV

Who we serve

R&D Engineer
R&D Manager
IP Professional

Why Eureka

Industry Leading Data Capabilities
Powerful AI technology
Patent DNA Extraction

Social media

Try Eureka

PatSnap group products

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic.

Semi-supervised image semantic segmentation method and device based on self-supervised low-rank network

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology