How to generate a scene graph from an input image
The iterative method enhances scene graph generation by leveraging multimodal models and external knowledge to produce a more detailed and accurate representation of image content, addressing the limitations of existing supervised methods.
Patent Information
- Authority / Receiving Office
- JP · JP
- Patent Type
- Applications
- Current Assignee / Owner
- ROBERT BOSCH GMBH
- Filing Date
- 2025-12-15
- Publication Date
- 2026-06-26
AI Technical Summary
Existing methods for generating scene graphs from images rely heavily on human-annotated data and require user input, limiting their applicability and accuracy, especially in unsupervised scenarios.
An iterative method using a multimodal base model and information extraction to generate scene graphs by extracting triplets from initial text descriptions, supplemented with targeted questions and external knowledge, allowing for a more comprehensive and accurate representation of image content.
Enables the creation of a detailed and robust scene graph through an unsupervised, dynamic approach, capturing broader contextual details like image quality, weather, and lighting, resulting in a richer and more informative output.
Smart Images

Figure 2026105857000001_ABST