Structured SVM-based unbalanced evaluation criterion direct optimization algorithm

An optimization algorithm and structured technology, applied in computing, computer parts, instruments, etc., can solve problems such as not reflecting the actual distribution, inclusion of noise data, misclassification of minority samples, etc.

Inactive Publication Date: 2016-10-12
ANHUI UNIVERSITY
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Facing the learning problem of unbalanced data sets, the difficulty of research mainly comes from the characteristics of the unbalanced data itself: the minority class samples in the unbalanced data set are insufficient, and the distribution of samples cannot well reflect the actual distribution of the entire class; the majority class Noisy data is usually mixed in, so that the two types of samples tend to overlap to varying degrees
When the traditional classification methods in the field of machine learning are directly applied to the classification problem of unbalanced data, it is easy to misclassify the samples of the minority class, which makes the classification accuracy of the minority class very low.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Structured SVM-based unbalanced evaluation criterion direct optimization algorithm
  • Structured SVM-based unbalanced evaluation criterion direct optimization algorithm
  • Structured SVM-based unbalanced evaluation criterion direct optimization algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] In this example, if figure 1 As shown, a direct optimization algorithm based on the unbalanced evaluation criterion of structured SVM is carried out as follows:

[0056] Step 1. Suppose there is an unbalanced data set, denoted as x i Indicates the i-th training sample, x i ∈R d ; d Represents the d-dimensional space of real numbers; y i Indicates the class label corresponding to the i-th training sample, and y i ∈{+1,-1}; when y i =+1, means the i-th training sample x i is a positive sample, when y i =-1, means the i-th training sample x i is a negative sample; 1≤i≤n;

[0057] All training samples in the unbalanced data set D are sorted positively and then negatively, so that all positive samples are in the front part of the unbalanced data set D, and all negative samples are in the back part of the unbalanced data set D, thus forming a sorted Unbalanced data set D'={(x,y)}; x represents the training sample queue, y represents the class label queue, and ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a structured SVM-based unbalanced evaluation criterion direct optimization algorithm. The algorithm is characterized by being implemented by the following steps of 1, selecting a data set and performing positive and negative sorting on all training samples in the data set; 2, defining a structured SVM framework-based target function; 3, defining an unbalanced evaluation criterion-oriented loss function delta GTP / PR(y,y) and an associated function psi (x,y) according to a loss function delta (y,y); 4, initializing a trade-off parameter C and an error allowable value epsilon; and 5, performing iterative solving on the target function by utilizing a cutting plane algorithm to obtain an institutionalized SVM framework-based target function. According to the algorithm, the classification precision of minority classes is improved; and the algorithm can be applied to classification of unbalanced data.

Description

technical field [0001] The invention relates to the technical field of statistical learning classification, more specifically, a direct optimization algorithm based on structured SVM imbalance evaluation criteria. Background technique [0002] With the advent of the data age and the rapid development of information technology, people will continue to generate massive amounts of data information in their daily life, production and business operations, and these data contain data information that is of great significance to people, but in There is often only a small part of information in such massive data that we really need to analyze and process. [0003] There are also many important applications in real life. For example, mobile phones and computers used in daily life provide telecom operators with a large amount of life data every day. Operators can obtain the preferences of different users by analyzing the effective information in these massive data, and launch suitabl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62
CPCG06F18/2411G06F18/214
Inventor 程凡张磊杨康刘政怡张兴义
Owner ANHUI UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products