Multi-modal saliency object detection method based on coding and decoding structure
An object detection, encoding and decoding technology, applied in the field of computer vision, can solve problems such as re-development, and achieve the effect of enhancing recognition accuracy and stability, reducing repetitive development costs, and promoting the development of industrial applications.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0017] The present invention will be described in detail below in combination with specific embodiments.
[0018] The present invention proposes a multimodal salient object detection method based on codec structure, which is implemented according to the following steps.
[0019] Step 1. Select an appropriate data set and perform preprocessing to divide the training set and test set.
[0020] The color image adopts the RGB color space format, and the depth image adopts the format of 0-255 grayscale value to express the depth information. The meaning of the pixel value of the depth image in the data set should be consistent with that of the depth perception device. The data set can be selected from five public data sets: NJU2K, LFSD, NLPR, STERE, and DES. In this embodiment, 1400 color images and corresponding depth images are randomly selected from the NJU2K data set, and 650 color images are randomly selected from the NLPR data set. A color image and the corresponding depth i...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


