An RGB-D semantic segmentation method based on cross-modal alignment fusion

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
By using the cross-modal alignment and fusion semantic segmentation network CCFN, and leveraging spatial channel attention and semantic flow correction modules, the problem of incomplete learning of feature relationships in RGB-D semantic segmentation is solved, achieving higher-precision object segmentation results.

CN117218348BActive Publication Date: 2026-06-16MINJIANG UNIVERSITY

View PDF 2 Cites 0 Cited by

Patent Information

Authority / Receiving Office: CN · China
Patent Type: Patents(China)
Current Assignee / Owner: MINJIANG UNIVERSITY
Filing Date: 2023-09-15
Publication Date: 2026-06-16

Smart Images

Figure CN117218348B_ABST

Patent Text Reader

Abstract

The application relates to an RGB-D semantic segmentation method based on cross-modal alignment fusion. A spatial channel attention mechanism is used to fully utilize the spatial and channel relationship, so that both branches can pay attention to the complementary information of the other branch to solve the noise problem caused by the introduction of multi-modal, and a semantic flow correction module is introduced to effectively solve the problem that corresponding pixels between different modalities cannot be well aligned. Finally, a decoder with a semantic flow correction module is also used in the decoder part to make the deep semantic information more effectively transmitted to the shallow layer. Experiments verify the reliability and feasibility of the algorithm.

Need to check novelty before this filing date? Find Prior Art