Attention mechanism-based image target prediction method
A technology of target prediction and attention, applied in neural learning methods, computer components, instruments, etc., to achieve the effect of improving efficiency and optimizing the visual backbone
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0060] The technical solutions and beneficial effects of the present invention will be described in detail below in conjunction with the accompanying drawings.
[0061] Such as figure 1 As shown, the present invention provides a kind of image target prediction method based on attention mechanism, comprises the following steps:
[0062] 1. Model implementation process:
[0063] 1.1 Input of the model:
[0064] Such as figure 2 As shown, the input of the model is an RGB image with a size of 320×320×3, and a description language for an object in the picture, and the longest text input of the model is set to 15.
[0065] 1.2 Visual Feature Encoder:
[0066] For the input RGB image, we use the VOC target detection dataset (see Mark Everingham, Luc Van Gool, Christopher K IWilliams, John Winn, and Andrew Zisserman. The pascalvisual object classes (voc) challenge. In IJCV, 2010.) The pre-trained neural network DeepLab-ResNet101 (see Liangchieh Chen, George Papandreou, Iasonas K...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


