Multi-modal saliency object detection method based on coding and decoding structure

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An object detection, encoding and decoding technology, applied in the field of computer vision, can solve problems such as re-development, and achieve the effect of enhancing recognition accuracy and stability, reducing repetitive development costs, and promoting the development of industrial applications.

Pending Publication Date: 2020-11-10

HANGZHOU DIANZI UNIV

View PDF2 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

At present, the main challenges are divided into two points. On the one hand, new algorithms need to be re-developed to jointly process color images and depth images; on the other hand, simple and effective strategies are needed to integrate depth information into existing algorithms.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0017] The present invention will be described in detail below in combination with specific embodiments.

[0018] The present invention proposes a multimodal salient object detection method based on codec structure, which is implemented according to the following steps.

[0019] Step 1. Select an appropriate data set and perform preprocessing to divide the training set and test set.

[0020] The color image adopts the RGB color space format, and the depth image adopts the format of 0-255 grayscale value to express the depth information. The meaning of the pixel value of the depth image in the data set should be consistent with that of the depth perception device. The data set can be selected from five public data sets: NJU2K, LFSD, NLPR, STERE, and DES. In this embodiment, 1400 color images and corresponding depth images are randomly selected from the NJU2K data set, and 650 color images are randomly selected from the NLPR data set. A color image and the corresponding depth i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a multi-modal saliency object detection method based on a coding and decoding structure. On the basis of an existing color image algorithm model, a depth image is used as priorinformation to be supplemented into an algorithm. The method provided by the invention can be divided into two parts: one part takes a color image as input and realizes saliency detection based on a deep learning technology of a coding and decoding structure; and the other part is used for processing the depth image, learning the depth features by using a lightweight neural network, and supplementing the feature information to the first part, thereby improving the recognition precision of the overall model. According to the method, the depth image can be simply and efficiently utilized, features are learned from the depth image, the recognition precision and stability of the algorithm are enhanced, and a quick and low-cost updating means can be provided for an existing deployed saliency detection algorithm.

Description

technical field [0001] The invention belongs to the technical field of computer vision, and is especially aimed at the task of image salient object detection, and in particular relates to a multimodal salient object detection algorithm based on a codec structure and combining color images and depth images. Background technique [0002] Computer vision technology is an important branch of the field of artificial intelligence. With the rapid development of neural networks and deep learning, many problems in the field of computer vision, such as image classification, target detection, and image segmentation tasks, have made great progress. The accuracy of the algorithm , The real-time performance has been greatly enhanced. At present, computer vision technology has been widely used in actual production and life, especially in the fields of security monitoring, payment, and intelligent manufacturing, to promote social production cost reduction and efficiency improvement, and imp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06K9/46G06K9/62G06N3/04G06N3/08

CPCG06N3/08G06V10/462G06N3/045G06F18/241

Inventor 周晓飞颜成钢潘亮贺熠凡孙垚棋张继勇张勇东

Owner HANGZHOU DIANZI UNIV

Multi-modal saliency object detection method based on coding and decoding structure

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology