RGBD saliency detection method based on feature aggregation

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A detection method, RGB image technology, applied in neural learning methods, instruments, biological neural network models, etc., can solve problems such as incompatibility

Pending Publication Date: 2020-11-13

HANGZHOU DIANZI UNIV

View PDF0 Cites 18 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

2) There are inherent differences in the two different modalities of RGB image and depth image, so it is difficult to effectively fuse these two modalities through simple concatenation

Specifically, RGB images mainly reflect appearance information such as color, texture, and brightness, while depth images express more spatial and geometric information in the scene. The two are quite different. If only a simple fusion strategy is adopted, it may lead to Incompatibilities

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0046] The present invention proposes a RGBD saliency detection method based on feature aggregation, which will be described in detail in conjunction with related steps below.

[0047] Our proposed method is implemented using the PyTorch toolbox and trained on a high-performance server with an NVIDIA GeForce RTX2080Ti GPU and 126GB of memory.

[0048] A RGBD saliency detection method based on feature aggregation, the steps are as follows:

[0049] Step 1, preprocessing the input image;

[0050] The input image includes a depth image and an RGB image. The HHA algorithm is used to encode the depth image from a single channel to a three-channel representation, which respectively characterizes the horizontal parallax, the height from the ground, and the pixel local surface normal and the inferred gravity direction. angle, forming an image pair with RGB image I and depth image D as model input.

[0051] Step 2. Construct a saliency detection network;

[0052] Such as figure 1 A...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides an RGBD saliency detection method based on feature aggregation. The RGBD saliency detection method comprises the steps that firstly, an input image is preprocessed; then constructing a significance detection network; the saliency detection network comprises a feature extraction network and a feature aggregation network. The feature extraction network is a pair of asymmetricdouble-flow backbone networks constructed based on ResNet50, and is divided into an RGB image feature extraction branch and a depth image feature extraction branch. The feature aggregation network comprises K nearest neighbors GNNs, a region enhancement module, a hierarchical fusion module and a block Non-local module. And finally, training a saliency detection network, and performing saliency detection through the trained saliency detection network. According to the method, 2D appearance and 3D geometric information are efficiently combined and reasoned, information from two different modes of an RGB image and a depth image is fully fused, and the multi-scale expression ability of the model is further improved through the hierarchical fusion module, so that rough-level features and fine-level features are well fused together.

Description

technical field [0001] The RGBD saliency detection method based on feature aggregation belongs to the field of computer vision, especially using convolutional neural networks to aggregate feature information of different modalities contained in RGB images and depth images. Background technique [0002] With the rapid development of computer vision and the wave of artificial intelligence, deep learning technology has been widely used. Using computer technology to simulate the attention mechanism of the human eye has become a new and challenging research hotspot. Visual salience means that when humans observe a certain area, there is a local area in the visual field that can attract human visual attention, and this local area is called a salient area. Saliency detection is mainly used to highlight salient regions in images or videos. Generally speaking, saliency detection is widely used in image segmentation, object recognition, video coding and other fields, and it is of gr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06K9/46G06K9/62G06N3/04G06N3/08

CPCG06N3/08G06V10/462G06N3/048G06N3/045G06F18/241

Inventor 颜成钢温洪发周晓飞孙垚棋张继勇张勇东

Owner HANGZHOU DIANZI UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

RGBD saliency detection method based on feature aggregation

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology