Scene graph generation method based on depth relation self-attention network

A technology of attention and scene graph, applied in neural learning methods, biological neural network models, natural language data processing, etc., can solve problems such as networks without modeling context information

Active Publication Date: 2021-03-09
HANGZHOU DIANZI UNIV
View PDF4 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] (2) How to model the semantic information of the scene graph at a deeper level to better obtain rich context information: most of the methods are to separately model the target context information and the relationship context information, and there is no need to model the context information The network performs deep stacking processing. The problem that may arise in this method is that it cannot effectively model the target context information and the relationship context information, because the target context information and the relationship context information affect each other, so what we use here is Simultaneous Model target context information and relationship context information, and obtain richer context information by stacking network depth

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Scene graph generation method based on depth relation self-attention network
  • Scene graph generation method based on depth relation self-attention network
  • Scene graph generation method based on depth relation self-attention network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0096] The detailed parameters of the present invention will be further specifically described below.

[0097] Such as figure 1 , 2 As shown, the present invention provides a scene graph generation method based on deep relational self-attention network.

[0098] The specific implementation of step (1) is as follows:

[0099] Eliminate some low-quality image data, and use the most common 150 target categories and 50 relationship categories; there is also a division of the data set, using 70% of the data in the data set for training, and the remaining 30% for testing.

[0100] Step (2) uses the trained target detection network to extract features from the image, as follows:

[0101] Each candidate box corresponds to the feature p of the image area f ,in , select 64 candidate boxes for each picture, and stitch all the candidate boxes in a picture into an overall feature

[0102] Step (3) constructs the spatial feature according to the spatial position coordinates of the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a scene graph generation method based on a depth relation self-attention network. The method comprises the following steps: 1, data preprocessing and data set division; 2, feature extraction of an image by using a pre-trained target detection network; 3, construction of spatial features of a target; 4, construction of language features of the target; the method is used formodeling the RSAN network of the target context and the relation context at the same time, achieves a significant improvement effect on a task generated by a scene graph, and surpasses most mainstreammethods on the task. In addition, the RSAN network also has very important application value and huge potential in other cross-modal related fields such as image content questioning and answering andvisual relationship detection.

Description

technical field [0001] The present invention proposes a method for generating scene graphs (Scene Graph Generation) based on deep relational self-attention networks (Relational Self-Attention Networks). Background technique [0002] Scene graph generation is an emerging task in the field of multimedia, which aims to model contextual information of objects and relationships among objects, and generate corresponding scene graphs for images. Specifically, an image is input, and a scene graph abstracted from the image is generated after passing through the model. The scene graph contains node and edge information, and the nodes and edges represent the relationship between the target and the target respectively. For example, the content of the image is a person wearing a hat holding a barrel and feeding a horse, which may include "a person wearing glasses", "a person feeding a horse", "a person carrying a barrel", "a horse from a barrel" Semantic information such as "eating food...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/583G06F16/587G06K9/72G06N3/04G06N3/08G06F40/284
CPCG06F16/5846G06F16/587G06N3/084G06F40/284G06V30/268G06V30/274G06V2201/07G06N3/045
Inventor 俞俊李娉余宙
Owner HANGZHOU DIANZI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products