Dual-sampling integration classification model based on Fisher kernel

A double-sampling and classification model technology, applied in the field of pattern recognition, to achieve accurate and improved classification effects

Inactive Publication Date: 2018-11-09
EAST CHINA UNIV OF SCI & TECH
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Usually in imbalanced problems, the difference in sample size leads to the fact that minority samples are rarely sufficiently minimized

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dual-sampling integration classification model based on Fisher kernel
  • Dual-sampling integration classification model based on Fisher kernel
  • Dual-sampling integration classification model based on Fisher kernel

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0010] Below in conjunction with accompanying drawing and example the present invention will be further introduced: the system designed by the present invention is divided into four modules altogether.

[0011] Part 1: Data Preprocessing

[0012] The number of sample subsets generated by sampling corresponds to the number of sub-classifiers, and the sub-classifiers are trained on the corresponding subsets. Define the negative class as the majority class sample, and the positive class as the minority class sample. All training samples are synthesized into a training matrix X according to the rule that each column is a sample for storage.

[0013] Part II: Fisher Kernel Mapping

[0014] In this part, the samples in the Fisher kernel space are used to form a new sample set. Therefore, in order to construct the Fisher kernel map, it is necessary to use the EM algorithm to obtain the component parameters of the Gaussian mixture model (GMM) for the data set. Now suppose the origi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention provides a dual-sampling integration classification model based on Fisher kernel. Data is mapped to a high-dimensional Fisher space to obtain sample expression with better distinguishing features. Samples are subjected to multiple feature-level sampling in a Fisher space to obtain multiple view angles to increase information required in the classification so as to improve thestability of a base classifier. An integration method is employed to perform integration of inside of the view angles and outside of the view angles. Integration of different view angle expressions is employed to maintain the diversity of the whole sample. The whole system is designed, and a frame integration method is provided for an unbalance classification problem; the classification information is increased for the unbalance data through the system so as to provide a more accurate classification result; combined training models with different structures can be used for concrete problems according to the concrete problems; and different sample expression matrixes with various forms and vectors are generated according to the sample information quantity to enrich the training data so asto improve the final classification effect.

Description

technical field [0001] The invention relates to the technical field of pattern recognition, in particular to a Fisher kernel-based double sampling integrated classification model for unbalanced data classification. Background technique [0002] There are two main categories of methods in pattern recognition, generative models and discriminative models. The generation model focuses on the generation process of the probability density function, which can provide more information for the study of the data. Whereas discriminative models focus on direct classification and directly learn classification boundaries. It makes decisions based on the prior knowledge of the training set, so it is less computationally expensive than the generative model, and its performance is better than the generative model. [0003] Existing methods to solve the imbalance problem mainly include sampling method, method of introducing cost function, and integration method. The sampling method rebalan...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/2411
Inventor 王喆李冬冬陈钊志杜文莉张静
Owner EAST CHINA UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products