RGBD saliency detection method based on feature aggregation

A detection method, RGB image technology, applied in neural learning methods, instruments, biological neural network models, etc., can solve problems such as incompatibility

Pending Publication Date: 2020-11-13
HANGZHOU DIANZI UNIV
View PDF0 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

2) There are inherent differences in the two different modalities of RGB image and depth image, so it is difficult to effectively fuse these two modalities through simple concatenation
Specifically, RGB images mainly reflect appearance...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • RGBD saliency detection method based on feature aggregation
  • RGBD saliency detection method based on feature aggregation
  • RGBD saliency detection method based on feature aggregation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The present invention proposes a RGBD saliency detection method based on feature aggregation, which will be described in detail in conjunction with related steps below.

[0047] Our proposed method is implemented using the PyTorch toolbox and trained on a high-performance server with an NVIDIA GeForce RTX2080Ti GPU and 126GB of memory.

[0048] A RGBD saliency detection method based on feature aggregation, the steps are as follows:

[0049] Step 1, preprocessing the input image;

[0050] The input image includes a depth image and an RGB image. The HHA algorithm is used to encode the depth image from a single channel to a three-channel representation, which respectively characterizes the horizontal parallax, the height from the ground, and the pixel local surface normal and the inferred gravity direction. angle, forming an image pair with RGB image I and depth image D as model input.

[0051] Step 2. Construct a saliency detection network;

[0052] Such as figure 1 A...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an RGBD saliency detection method based on feature aggregation. The RGBD saliency detection method comprises the steps that firstly, an input image is preprocessed; then constructing a significance detection network; the saliency detection network comprises a feature extraction network and a feature aggregation network. The feature extraction network is a pair of asymmetricdouble-flow backbone networks constructed based on ResNet50, and is divided into an RGB image feature extraction branch and a depth image feature extraction branch. The feature aggregation network comprises K nearest neighbors GNNs, a region enhancement module, a hierarchical fusion module and a block Non-local module. And finally, training a saliency detection network, and performing saliency detection through the trained saliency detection network. According to the method, 2D appearance and 3D geometric information are efficiently combined and reasoned, information from two different modes of an RGB image and a depth image is fully fused, and the multi-scale expression ability of the model is further improved through the hierarchical fusion module, so that rough-level features and fine-level features are well fused together.

Description

technical field [0001] The RGBD saliency detection method based on feature aggregation belongs to the field of computer vision, especially using convolutional neural networks to aggregate feature information of different modalities contained in RGB images and depth images. Background technique [0002] With the rapid development of computer vision and the wave of artificial intelligence, deep learning technology has been widely used. Using computer technology to simulate the attention mechanism of the human eye has become a new and challenging research hotspot. Visual salience means that when humans observe a certain area, there is a local area in the visual field that can attract human visual attention, and this local area is called a salient area. Saliency detection is mainly used to highlight salient regions in images or videos. Generally speaking, saliency detection is widely used in image segmentation, object recognition, video coding and other fields, and it is of gr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/46G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06V10/462G06N3/048G06N3/045G06F18/241
Inventor 颜成钢温洪发周晓飞孙垚棋张继勇张勇东
Owner HANGZHOU DIANZI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products