A multimodal target detection method and system suitable for missing modality
A target detection, multi-modal technology, applied in neural learning methods, character and pattern recognition, biological neural network models, etc., can solve the problems of noise in detection results, information loss, modal data loss, etc., to reduce the amount of calculation. and complexity, the effect of improving consistency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0082] Embodiment 1: A multi-modal target detection method suitable for missing modalities. In the following embodiment, H is the height, W is the width, and D is the depth. The two-dimensional RGB image data collected by the camera and the data collected by the lidar are used. Taking the two modal data of 3D space point cloud data as an example, a detailed description will be given, including the following steps:
[0083] S1. Real mode generation stage:
[0084] 1), based on the open source multimodal data set KITTI, to obtain the three-dimensional spatial point cloud data and two-dimensional RGB image data of lidar in the same space and time;
[0085] 2) The two-dimensional RGB image data is reflected in a mathematical expression, that is, a modal feature tensor of size (H, W, 3), which represents the height, width and RGB channels of the image respectively; Resnet network is used As a feature extraction unit for two-dimensional RGB image data, the extraction accuracy is fu...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


