Salient object detection method based on deep network layering and multi-task training

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An object detection, deep network technology, applied in the field of image processing and computer vision, can solve the problem of insufficient details of the edge of the object

Active Publication Date: 2021-01-26

SICHUAN UNIV

View PDF7 Cites 1 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Although there are many existing detection models and algorithms for salient objects, it is still extremely challenging to detect salient objects from complex and unrestricted scenes. How to more accurately locate salient objects and segment the exact boundaries of the object is the key to be solved urgently one of the problems

At present, the emerging saliency detection method based on deep learning (for example, Li et al. proposed "DeepSaliency: Multi-Task Deep NeuralNetwork Model for Salient Object Detection" in 2016), although using multi-task training, in terms of locating salient objects It has great advantages, but it still has a lot of shortcomings in describing the details of the edge of the object

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0032] In the first step, one or more tasks associated with the salient object detection task prepare training pictures required for multi-task training. Among them, for the saliency detection task, the training picture includes the original image and its corresponding saliency map, and for other tasks, the training picture includes the original image and its corresponding real result. Other tasks refer to one or more tasks that are intrinsically related to the saliency detection task and can share features, such as semantic segmentation, human gaze point prediction, etc. For semantic segmentation tasks, the training image includes the original image and the class label map to which the region in the image belongs.

[0033] The second step is to design a deep neural network architecture and loss function with hierarchical features. Specific steps include:

[0034] S2-1: Design the network structure, which includes a main forward network and multiple side paths connected to s...

Embodiment 2

[0040]The difference between embodiment 2 and embodiment 1 is that the task associated with the salient object detection task is the human gaze point prediction task, and the multi-task training pictures adopted in the joint training include the original image and the human eye gaze point prediction task picture, and the hierarchical The loss function corresponding to the human gaze point prediction task in the feature deep neural network architecture is the cross-entropy loss function, and the corresponding hierarchical feature deep neural network architecture is as follows: Figure 4 shown.

[0041] Other methods and steps are the same as in Embodiment 1, and will not be repeated here.

[0042] Based on the deep neural network architecture with hierarchical features in Embodiment 1, other tasks are expanded in the manner of this embodiment, and the formed network architecture should fall within the scope of protection of the present invention.

Embodiment 3

[0044] The difference between embodiment 2 and embodiment 1 is that the task associated with the salient object detection task uses multiple tasks, the multiple tasks are human gaze point prediction tasks and semantic segmentation tasks, and the corresponding deep neural network architecture of hierarchical features like Figure 5 shown.

[0045] The first step is to prepare the training pictures required for multi-task training. Among them, for the saliency detection task, the training picture includes the original image and its corresponding saliency map; for the semantic segmentation task, the training picture includes the original image and the class label map to which the area in the picture belongs; for the human gaze point prediction task, the training The picture contains the original picture and the prediction picture of human gaze point.

[0046] The second step is to design a deep neural network architecture and loss function with hierarchical features. Specific ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a salient object detection method based on deep network layering and multi-task training in the technical field of image processing and computer vision. The steps include: 1. Determine one or more tasks associated with the salient object detection task ; 2, select the multi-task training picture; 3, carry out joint training to the deep neural network of hierarchical feature, obtain the deep neural network of optimized hierarchical feature, the deep neural network of described hierarchical feature adopts the depth of hierarchical feature Neural network architecture; 4. Input images to the deep neural network with optimized hierarchical features to obtain salient object detection results. The invention utilizes multi-task joint training and integrates the layered features of the deep neural network to realize more accurate positioning of salient object detection and more accurate and detailed object edge description.

Description

technical field [0001] The invention relates to the technical fields of image processing and computer vision, in particular to a salient object detection method based on deep network layering and multi-task training. Background technique [0002] Salient object detection intends to automatically detect eye-catching objects in images or scenes, and the detected areas or objects can be input into subsequent processing modules as regions of interest, in target detection and recognition, image compression, image retrieval, content-based image Editing and other fields have a wide range of applications. Although there are many existing detection models and algorithms for salient objects, it is still extremely challenging to detect salient objects from complex and unrestricted scenes. How to more accurately locate salient objects and segment the exact boundaries of the object is the key to be solved urgently one of the problems. At present, the emerging saliency detection method ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G06K9/00G06N3/08G06N3/04

Inventor 傅可人赵启军

Owner SICHUAN UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Salient object detection method based on deep network layering and multi-task training

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology