Distributed support vector clustering method and system

A support vector clustering and support vector technology, applied in the field of data processing, can solve the problems of storage space consumption, increasing algorithm complexity, reducing scale, etc.

Inactive Publication Date: 2015-03-11
XUCHANG UNIV
View PDF5 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the biggest disadvantage of this method is the poor performance, the huge consumption of storage space by the kernel matrix built on the entire training set, and the expensive time cost when solving the support function describing the hypersphere.
[0005] The existing methods for improving the efficiency of support vector clustering analysis mainly include 1) conversion to solve the dual problem of the support function, but although it is beneficial to the calcul

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Distributed support vector clustering method and system
  • Distributed support vector clustering method and system
  • Distributed support vector clustering method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] The purpose of the present invention is to provide a method for distributed support vector clustering, which can effectively improve the storage efficiency and time efficiency of the support vector clustering method, and another purpose of the present invention is to provide a method for support vector clustering system.

[0057] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0058] The ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a distributed support vector clustering method which comprises the following steps: processing the input data set according to a preset processing rule, and initializing global parameters and tasks; distributing a preset data set or a specific calculation result to each computational node; initializing a weight vector of the preset data set when the preset data set is distributed to the computational node; performing iterative operation according to a preset formula, and calculating a weight coefficient value of each sample in the preset data set; finding a sample of which the weight coefficient value is higher than a preset minimal value to serve as a support vector, and numbering the support vector; and building a support function by utilizing the support vector and the weight coefficient of each support vector, performing cluster division to obtain a cluster label of the support vector, and calibrating a non-support vector sample as a clustering analysis result. According to the method, the support vector clustering efficiency can be effectively improved.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a distributed support vector clustering method and system. Background technique [0002] Clustering analysis is to find the observation value clusters with close relationship in the object set through some similarity measure, so that the similarity between the objects in the cluster is as large as possible, and the similarity between the objects of different clusters is as large as possible. Potentially small, even different or irrelevant. [0003] At present, the advantages and disadvantages of clustering analysis methods are carried out by measuring effectiveness and realizing performance, namely time efficiency and storage efficiency. [0004] The support vector clustering method is better for effectiveness. Among them, support vector clustering is a kind of method based on kernel function. It maps data from input space to high-dimensional feature space by using kerne...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/90335
Inventor 平源李慧娜张志立张永杨月华马慧
Owner XUCHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products