Supercharge Your Innovation With Domain-Expert AI Agents!

Scene graph generation method and device

A technology of scene graph and image data, which is applied in the field of computer vision, can solve problems such as poor learning effect, visual relationship detection effect needs to be improved, and noise labels are not considered, so as to achieve the effect of reducing label noise and excellent visual relationship detection performance

Active Publication Date: 2021-10-26
航天宏康智能科技(北京)有限公司
View PDF12 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in practical engineering, the cost of collecting and annotating large-scale datasets is high, and manual annotation is error-prone, and even high-quality datasets may contain noisy labels
At the same time, the visual visual relationship in the scene graph has a long-tail effect, that is, most existing scene graph models are good at fitting predicates with high frequency in the dataset, but poor in learning visual relationships with fewer labeled instances.
Specifically, the existing scene graph generation methods have stable performance for frequently occurring predicate categories, but are not ideal for noisy labels in the dataset and difficult-to-learn relationship categories.
Although the existing research enhances the visual relationship detection ability of the model by improving the data set, such as using the generated missing labels to train the scene graph and alleviating the semantic ambiguity in visual relationship detection through probabilistic modeling, none of them considers the large-scale process. The problem of noisy labels is common in human-annotated datasets, and the visual relationship detection effect for datasets containing noisy labels needs to be improved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Scene graph generation method and device
  • Scene graph generation method and device
  • Scene graph generation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] The following detailed description is provided to assist the reader in gaining an overall understanding of the methods, devices and / or systems described herein. However, various changes, modifications and equivalents of the methods, apparatus and / or systems described herein will be apparent after understanding the disclosure of the present application. For example, the order of operations described herein are examples only, and are not limited to those orders set forth herein, but, except for operations that must occur in a particular order, may occur as will become apparent after understanding the disclosure of this application. That's changed. Also, descriptions of features that are known in the art may be omitted for increased clarity and conciseness.

[0053] The features described herein may be implemented in different forms and should not be construed as limited to the examples described herein. Rather, the examples described herein have been provided to illustr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a scene graph generation method and device. The generation method comprises the following steps of acquiring the image data and the text data; obtaining the feature vectors of the image data through a Faster R-CNN target detector, and fusing the feature vectors to obtain the visual feature vectors; obtaining the word vectors of the text data based on a pre-trained fastText model, and fusing the word vectors to obtain the semantic feature vectors; and matching the visual feature vectors and the semantic feature vectors to obtain a visual relationship prediction value, and performing symmetric learning on the visual relationship prediction value by using a cross entropy function and a reverse cross entropy function to obtain a final visual relationship prediction value, thereby generating a scene graph. According to the generation method, the problem of label noise of a data set subjected to manual annotation can be effectively reduced.

Description

technical field [0001] The present disclosure generally relates to the field of computer vision, and more specifically, relates to a method and device for generating a scene graph based on symmetric learning. Background technique [0002] The rapid development of the field of computer vision has made many breakthroughs in visual tasks such as image classification, semantic segmentation and visual relationship detection in a short period of time. Network (Region-based CNN, R-CNN) and fully convolutional network (Fully Convolutional Network, FCN), driven. [0003] On this basis, the research on image understanding has gradually developed from low-level feature extraction to high-level semantic learning. The next step is to infer the semantic relationship between multiple objects, thereby promoting the development of multimodal tasks, such as visual question answering, image description and visual commonsense reasoning. Among them, the emergence of Scene Graph Generation (SGG...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62G06K9/00G06F40/30G06F40/284G06N3/04G06N3/08
CPCG06F40/30G06F40/284G06N3/084G06N3/045G06F18/22G06F18/253G06F18/214
Inventor 经小川刘萱杜婉茹王潇茵孙鹏程
Owner 航天宏康智能科技(北京)有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More