Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A fine-grained image classification method based on discriminative learning

A classification method and fine-grained technology, applied in the field of computer vision, can solve problems such as high computing costs, achieve accurate and effective classification, and reduce the number of effects

Active Publication Date: 2022-07-01
DALIAN UNIV OF TECH
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

A large number of about 2000 candidate patches generated by SS need to be predicted whether to contain discriminative features through a deep CNN classification network, which requires high computational cost

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A fine-grained image classification method based on discriminative learning
  • A fine-grained image classification method based on discriminative learning
  • A fine-grained image classification method based on discriminative learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] In order to make the objectives, technical solutions and advantages of the present invention clearer, the specific embodiments of the present invention are further described in detail below.

[0050] Experimental evaluations are performed on two benchmark datasets: Caltech-UCSD Birds-200-2011 and Stanford Cars, which are widely used benchmarks for fine-grained image classification. Birds include 11,788 images in 200 categories. Cars include 16,185 images with a total of 196 classes.

[0051] Implementation details: In our experiments, all images are resized to 448 × 448. We use ResNet-50 as the backbone network and batch normalization as the regularization term. Our optimizer uses momentum SGD with an initial learning rate of 0.001 and multiplied by 0.1 after every 60 epochs. The weight decay rate is set to 1e-4. To reduce patch redundancy, we used Non-Maximum Suppression (NMS), and the NMS threshold was set to 0.25.

[0052] Ablation Experiments: We conduct some a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of computer vision, and provides a fine-grained image classification method based on discriminant learning. We propose a new end-to-end autoregressive localization with discriminative prior network model that learns to explore more accurate discriminative patch sizes and is able to classify images in real-time. Specifically, a multi-task discriminative learning network is designed, including an autoregressive localization sub-network and a discriminative prior sub-network with a bootstrap loss function and a consistency loss function to simultaneously learn the self- Regression coefficients and discriminative prior map. The autoregressive coefficients can reduce the noise information in the discriminative patch, and the discriminative prior map filters thousands of candidate patches into single-digit patches by learning the discriminative probability value. Extensive experiments show that the proposed SDN model achieves the state-of-the-art in both accuracy and efficiency.

Description

technical field [0001] The invention belongs to the technical field of computer vision, and takes improving the accuracy and efficiency of fine-grained image classification as a starting point, and proposes a fine-grained image classification method based on discriminant learning. Background technique [0002] Unlike general image classification, Weakly Supervised Fine-Grained Image Classification (WFGIC) uses only image-level labels to identify objects at more detailed categories and granularities. WFGIC has received extensive attention from academia and industry due to its numerous potential applications in image understanding and computer vision systems. WFGIC is an open problem in the field of computer vision, not only because images belonging to the same subcategory vary widely in size, pose, color, and background, whereas images belonging to different subcategories can be very similar in these aspects, and only using Image-level label extraction is very difficult to d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06V10/764G06V10/766G06V10/82G06K9/62
CPCG06F18/2415
Inventor 王智慧王世杰李豪杰唐涛王宁
Owner DALIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products