Visual question and answer method based on GAT relation reasoning
A relational and visual technology, applied in the field of image processing, can solve the problem of ignoring spatial reasoning, semantic relations and scene understanding, and achieve the effect of improving accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0049] The present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.
[0050] The present invention is based on the visual question answering method of GAT relational reasoning, specifically implements according to the following steps:
[0051] Step 1, question embedding, divide the question into independent words according to punctuation marks and spaces; use the Glove word vector model to vectorize the words; use the bidirectional gated recurrent unit to extract the question vector representation. At the same time, in order to reduce the impact of question noise on the answer prediction results; specifically:
[0052] Step 1.1: First divide the input question into individual words according to punctuation marks and spaces; the input question is converted into an array of words, expressed as the following formula:
[0053] q=[q 1 ,q 2 ,...,q N ]
[0054] Among them, N is the number of words ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com



