Model training method, address localization methods, electronic device, storage medium and computer program product

WO2026130030A1PCT designated stage Publication Date: 2026-06-25CLOUD INTELLIGENCE ASSETS HOLDING (SINGAPORE) PTE LTD +1

Patent Information

Authority / Receiving Office
WO · WO
Patent Type
Applications
Current Assignee / Owner
CLOUD INTELLIGENCE ASSETS HOLDING (SINGAPORE) PTE LTD
Filing Date
2025-11-21
Publication Date
2026-06-25

Smart Images

  • Figure CN2025136871_25062026_PF_FP_ABST
    Figure CN2025136871_25062026_PF_FP_ABST
Patent Text Reader

Abstract

The embodiments of the present application relate to the technical fields of large models and image address localization, and provide a model training method, address localization methods, an electronic device, a storage medium and a computer program product. The model training method comprises: acquiring a training data set, the training data set comprising training scene images and a visual question answering data set of address information corresponding to the training scene images; using the training data set to perform cross-view alignment training on an initial multi-modal address localization model, so as to generate an intermediate multi-modal address localization model; and using the training data set to perform address localization training on the intermediate multi-modal address localization model, so as to generate a target multi-modal address localization model, the target multi-modal address localization model being used for performing address localization analysis on a target scene image to be processed and an address question, so as to obtain an address answer. The described method solves the problems of coarse localization granularity, poor localization accuracy, and poor flexibility of question-answering interaction of image address localization solutions in the prior art.
Need to check novelty before this filing date? Find Prior Art