Illusion processing method and device of multi-modal large model, equipment and medium
By constructing a hallucination assessment framework and a semantic segmentation-based differentiated reweighting strategy, the shortcomings of multimodal large models in hallucination assessment and relief are addressed, achieving fine-grained quantitative evaluation and accurate hallucination relief, and improving the accuracy and reliability of the model's output.
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- HANGZHOU INST FOR ADVANCED STUDY UCAS
- Filing Date
- 2026-05-25
- Publication Date
- 2026-06-19
AI Technical Summary
Existing multimodal large models struggle to cover fine-grained hallucination types in hallucination evaluation, lack scene adaptability, and lack fine-grained distinction in inference intervention, resulting in inaccurate outputs from the models in complex visual semantic tasks.
A hallucination assessment framework is constructed. By comparing assessment samples with real labels, key attention heads are located. Then, semantic segmentation and differentiated reweighting strategies are used to adjust attention weights to achieve hallucination relief in a multimodal large model.
It enables fine-grained quantitative evaluation of multimodal large models across different evaluation dimensions, accurately locates the causes of hallucinations, effectively alleviates hallucination problems in the reasoning process, enhances the model's scenario adaptability in hallucination relief, and improves the accuracy and reliability of the output results.
Smart Images

Figure CN122242584A_ABST