Method for classifying N central points based on gene expression programming

A classification method and expression technology, applied in the field of machine learning, can solve the problems of sensitive balance and large amount of calculation of data to be classified, and achieve the effect of improving efficiency, reducing time complexity and improving search efficiency

Inactive Publication Date: 2015-06-10
ZHEJIANG UNIV OF TECH
View PDF0 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The k-nearest neighbor method is a widely used distance-based classification method, but the accuracy of the method is sensitive to the value of k and the balance of the training set data, and the amount of calculation for each test to be classified is relatively large

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for classifying N central points based on gene expression programming
  • Method for classifying N central points based on gene expression programming
  • Method for classifying N central points based on gene expression programming

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] The N medoid classification method constitutes a classifier by finding a set of medoids that can accurately represent classes in a multidimensional space. This process involves search and evaluation: searching for class medoids in a multidimensional space and evaluating whether these class medoids accurately represent the class. Gene expression programming has the characteristics of parallel search and strong global optimization ability. The invention applies gene expression programming to the N-center point classification method, and proposes an N-center point classification method based on gene expression programming.

[0043] An N-centroid classification method based on gene expression programming, comprising:

[0044] Step S1. Randomly divide the training data set X into equal-sized labeled data sets X l and a dataset without class labels X u .

[0045] Step S2, for an n-category classification problem, never contain a class label data set X u Randomly select n...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for classifying N central points based on gene expression programming and belongs to the technical field of machine learning. The method includes the steps of firstly, partitioning to-be-classified training data into a classification-marker-contained data set and a classification-marker-free data set; secondly, searching classification central points from a multi-dimensional space of the classification-marker-free data set according to a distance formula; thirdly, assessing accuracy of the searched classification central points on the classification-marker-contained data set; fourthly, using a gene expression programming method for expressing, searching and updating the classification central points to obtain new classification central points; fifthly, classifying unknown points according to the distances between the unknown points and the new classification central points; sixthly, repeating the third step, the fourth step and the fifth step until an objective function satisfies stop conditions. By the aid of the method for classifying the N central points based on gene expression programming, calculated quantity can be decreased while susceptibility to unbalanced data is avoided.

Description

technical field [0001] The invention relates to an N-centroid classification method based on gene expression programming, which belongs to the technical field of machine learning. Background technique [0002] Distance-based classification methods represent each datum as a numeric vector, constructing a class center for each class. When classifying, calculate the distance between the data to be classified and each class center point, and the category of the data to be classified is the class represented by the nearest class center point. [0003] The k-nearest neighbor classification method is a commonly used distance-based classification method. It avoids the difficulty of directly constructing the class center, and finds k data points closest to the data to be classified by calculation, and the category of the data to be classified is the category with the largest number of k data. The k-nearest neighbor method is a commonly used distance-based classification method, but...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/2246G06F16/285
Inventor 李曲
Owner ZHEJIANG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products