A Weakly Supervised Object Classification and Localization Method Based on Divergence Learning
A technology of target classification and localization method, applied in the field of weakly supervised target classification and localization based on divergence learning, it can solve problems such as difficulty in optimizing object localization, and achieve the effect of optimizing image classification loss, optimizing divergence loss, and high performance.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0087] 1. Database and sample classification
[0088] The divergence network is evaluated on the commonly used CUB-200-2011 and ILSVRC2016 data sets. CUB-200-2011 contains 11,788 images of 200 species of birds, of which 5,994 are used for training and 5,794 are used for testing. According to taxonomy, we divide 200 species of birds into three levels, including 122 genera, 37 families and 11 orders. For ILSVRC2016, we used 1.2 million images and 1,000 classes for training, and used 5,000 images for testing in the validation set. We apply the ready-made category hierarchy that appears with the ILSVRC 2016 data set. For example, "dog", "cat" and "rabbit" are grouped into the parent category "animal", and "chairs" and "tables" are grouped into the parent category "furniture".
[0089] Construction of classification and positioning network: Integrate the divergent activation module with VGGnet and GoogLeNet, including VGGnet and GoogLeNet: delete the VGG-16 network and the pooling la...
experiment example 1
[0130] The effectiveness of the hierarchical divergence activation module and the differential divergence activation module (differential divergence) in the network and the proposed regularization factor λ are respectively verified.
[0131] 1) The influence of hierarchical divergence activation module and differential divergence activation module
[0132] Table 5: The influence of the level of divergence activation module and the difference divergence activation module
[0133]
[0134]
[0135] As shown in Table 5, compared with the baseline CAM method, the introduction of the hierarchical divergence activation module reduces the top-1 / top-5 positioning error rate by 5.14% / 4.36%. in Image 6 In, the example of activation diagram shows the impact of hierarchical divergence activation modules. Only from the supervision of sub-category tags, CAM tends to activate object parts, such as bird heads. Through the introduction of hierarchical supervision of image categories, the activatio...
experiment example 2
[0142] Experimental example 2 The influence of the number of feature output layers
[0143] The divergence learning network model based on the VGGnet network was tested on the CUB-200-2011 test set to verify the influence of the number of feature output layers. The results are shown in Table 6 below.
[0144] Table 6 The influence of the number of feature output layers on positioning
[0145] Feature output layers Positioning error rate 155.85 252.8 350.71 451.34
[0146] It can be seen in Table 6 that as the number of feature output layers increases, the overall positioning error rate is decreasing, which shows that the use of hierarchical divergence activation modules can effectively improve the positioning effect, and when the number of feature output layers increases from three to four When layering, the positioning result drops. This is because the shallow features are not enough to distinguish the object category, which affects the positioning result.
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


