An RGB-D semantic segmentation method based on cross-modal alignment fusion

By using the cross-modal alignment and fusion semantic segmentation network CCFN, and leveraging spatial channel attention and semantic flow correction modules, the problem of incomplete learning of feature relationships in RGB-D semantic segmentation is solved, achieving higher-precision object segmentation results.

CN117218348BActive Publication Date: 2026-06-16MINJIANG UNIVERSITY

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Patents(China)
Current Assignee / Owner
MINJIANG UNIVERSITY
Filing Date
2023-09-15
Publication Date
2026-06-16

Smart Images

  • Figure CN117218348B_ABST
    Figure CN117218348B_ABST
Patent Text Reader

Abstract

The application relates to an RGB-D semantic segmentation method based on cross-modal alignment fusion. A spatial channel attention mechanism is used to fully utilize the spatial and channel relationship, so that both branches can pay attention to the complementary information of the other branch to solve the noise problem caused by the introduction of multi-modal, and a semantic flow correction module is introduced to effectively solve the problem that corresponding pixels between different modalities cannot be well aligned. Finally, a decoder with a semantic flow correction module is also used in the decoder part to make the deep semantic information more effectively transmitted to the shallow layer. Experiments verify the reliability and feasibility of the algorithm.
Need to check novelty before this filing date? Find Prior Art