Long-tail distribution image data identification method based on dual-channel learning

A technology of image data and recognition method, which is applied in the field of long-tail distribution image data recognition based on dual-channel learning, to achieve the effects of improving feature representation, enhancing compactness, and improving recognition accuracy

Pending Publication Date: 2020-10-02
SOUTH CHINA UNIV OF TECH
View PDF0 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to overcome the shortcomings and deficiencies of the prior art, and proposes an effective, scientific and reasonable long-tail distribution image data recognition method based on dual-channel learning, which combines unbalanced learning and small sample learning for To solve the problem of long-tail distribution image data recognition, the unbalanced learning channel can improve the recognition accuracy of the model for unbalanced datasets, and the small-sample learning channel can improve the feature representation learned by the model and enhance the model's ability to recognize tail-like image data; The constructed dual-channel learning total loss function enables the model to focus on the unbalanced learning channel in the early stage of training, and focus on the small-sample learning channel in the later stage of training, thereby improving the recognition accuracy of the model for long-tail image data as a whole.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Long-tail distribution image data identification method based on dual-channel learning
  • Long-tail distribution image data identification method based on dual-channel learning
  • Long-tail distribution image data identification method based on dual-channel learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The present invention will be further described below in conjunction with specific examples.

[0033] The Places365 dataset is a large image dataset covering 365 scene categories, each category contains no more than 5000 training images, 50 validation images and 900 test images. The Places365 original data set is down-sampled according to the Pareto distribution with a power index parameter of 6, and the training set of the obtained long-tail distribution image data set contains a total of 62500 pictures, of which each category contains a maximum of 4980 pictures and a minimum of 5 pictures Picture, the training set Places-LT of the constructed long-tail distribution image dataset such as figure 1 shown. The validation set of the long-tail distribution image dataset samples 20 images per category, which is used to track and evaluate the performance of the dual-channel learning model. The test set of the long-tail distribution image dataset samples 50 images per catego...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a long-tail distribution image data identification method based on dual-channel learning. The method comprises the following steps: 1) constructing a dual-channel learning model combining unbalanced learning and small sample learning; 2) updating all parameters in the dual-channel learning model by utilizing dual-channel learning total loss and back propagation, and storingoptimal dual-channel learning model parameters; and 3) inputting the image data of the test set to the optimal dual-channel learning model, and obtaining the prediction label of the image. Accordingto the invention, unbalanced learning and small sample learning are combined to solve the problem of long-tail distribution image data identification; the unbalanced learning channel can improve the identification accuracy of the unbalanced data set; the small sample learning channel can improve the feature representation of model learning, and the dual-channel total loss enables the model to focus on the unbalanced learning channel in the early stage of training and focus on the small sample learning channel in the later stage of training, thereby improving the recognition accuracy of the long-tail distribution image data on the whole.

Description

technical field [0001] The invention relates to the technical field of unbalanced classification, small-sample learning, and long-tail distribution image data recognition in machine learning, in particular to a long-tail distribution image data recognition method based on dual-channel learning. Background technique [0002] Long-tail distribution image data recognition usually uses technologies related to unbalanced learning, which are mainly divided into data level and algorithm level. Data-level techniques mainly include downsampling of majority class samples, upsampling of minority class samples or a hybrid sampling method combining the two. However, the resampled data cannot reflect the real data distribution characteristics. For example, the down-sampling method will discard most of the samples, thus losing a lot of valuable information in the data set. The up-sampling method will lead to over-fitting problems and bring Huge computing power consumption. The technology...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06N3/045G06F18/241G06F18/214Y02T10/40
Inventor 陈琼林恩禄朱戈仁
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products