Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Visual semantic relation detection method based on RGB-D image

A semantic relationship and visual technology, applied in the field of visual semantic relationship detection, can solve problems such as low accuracy and insufficient generalization ability, and achieve the effect of enriching dimensions, rich detection basis, and improving accuracy and generalization ability.

Pending Publication Date: 2020-07-03
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the complex visual scene in the image, the traditional semantic relationship detection method has problems such as low accuracy and insufficient generalization ability when extracting the visual semantic relationship.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Visual semantic relation detection method based on RGB-D image
  • Visual semantic relation detection method based on RGB-D image
  • Visual semantic relation detection method based on RGB-D image

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The present invention provides a visual semantic relationship detection method based on RGB-D images, the basic idea is: while extracting the RGB feature vector of the image, estimate the depth map of the image and extract the corresponding depth feature vector, and combine the RGB feature vector with The depth feature vectors are fused; the fused feature vectors are input into a visual semantic relationship classifier, and the visual semantic relationship classifier is designed to simultaneously process RGB feature vectors and depth feature vectors and weaken the depth feature vector part to obtain visual semantic relationship detection results.

[0042] It can be seen that the present invention increases the estimated depth feature vector in the detection process, and the detection basis is more abundant. At the same time, considering the inaccuracy of the depth feature vector, it reduces the negative impact of its inaccurate components on the detection model while usin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a visual semantic relation detection method based on an RGB-D image, and the method comprises the steps: firstly extracting an RGB feature vector of the image, estimating a depth map of the image, extracting a corresponding depth feature vector, and enabling the RGB feature vector and the depth feature vector to be fused; and inputting the fused feature vectors into a visual semantic relationship classifier, the visual semantic relationship classifier being designed to process the RGB feature vectors and the depth feature vectors at the same time and weaken the depth feature vector part to obtain a visual semantic relationship detection result. The detection precision and the generalization ability of the visual semantic relationship in a complex visual scene can beimproved.

Description

technical field [0001] The invention relates to deep learning and computer vision, in particular to a visual semantic relationship detection method in visual scene understanding. Background technique [0002] In the face of a large amount of image information, if the computer is to truly understand the content of the visual scene, it is not only necessary to detect each object in the image, but also to extract the interactive relationship between each object. Due to the complex visual scene in the image, traditional semantic relationship detection methods have problems such as low accuracy and insufficient generalization ability when extracting the visual semantic relationship. [0003] RGB-D images include RGB information and depth (Depth) information, which contains more information than traditional RGB images. At the same time, the fusion of RGB information and depth information in semantic relationship detection is more in line with the law of human cognition . The RGB...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/32G06K9/62
CPCG06V10/25G06F18/241G06F18/253
Inventor 甘明刚刘晓舟陈杰窦丽华
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products