Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Entropy sequencing-based semi-supervision spectral clustering method for determining clustering number

A sorting method and clustering number technology, which is applied in the fields of instruments, character and pattern recognition, computer parts, etc., can solve the problems of poor big data results, manual setting of thresholds, failures, etc., and achieve the effect of improving the clustering effect.

Inactive Publication Date: 2011-02-09
XIDIAN UNIV
View PDF1 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The algorithm has achieved better results to a certain extent, but the results for complex data are not ideal.
[0009] Since the above methods of automatically determining the number of clusters all select the eigenvectors corresponding to the top k largest eigenvalues, they have the following disadvantages: 1. Susceptible to noise and cause clustering errors; 2. The results are not good or invalid for large data; 3. .Need to manually set the threshold

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Entropy sequencing-based semi-supervision spectral clustering method for determining clustering number
  • Entropy sequencing-based semi-supervision spectral clustering method for determining clustering number
  • Entropy sequencing-based semi-supervision spectral clustering method for determining clustering number

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] refer to figure 1 , the specific realization of the present invention comprises the following steps:

[0048] Step 1, calculate the scale parameter σ of each point i and the affinity matrix A of the dataset.

[0049] 1a) Input image dataset X={x 1 , x 2 ,...,x n}∈R d , where x i Represents any point in the data set, i∈(1,n), n is the number of data, and d represents the dimension of the data;

[0050] 1b) Calculate the scale parameter σ of each point of the input image data X i :

[0051] σ i = 1 m Σ d = 1 m | | x i - x d | |

[0052] Among them, σ i Represents the scale parameter of any point in the data points, x d is any point x in the data class X i The dth...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an entropy sequencing-based semi-supervision spectral clustering method for determining a clustering number, and mainly solves the problem of selection of a characteristic vector of a Laplacian matrix in spectral clustering. The method comprises the following steps of: performing permutation on the characteristic vector by an entropy sequencing theory so as to acquire an array of the characteristic vector with the uppermost importance; for a k class problem, extracting the first k arrays of the characteristic vector and projecting the arrays into a k-dimensional space; clustering according to the distance of each point and 2k semiaxes in the k-dimensional space; recording the preserved clustering number as c except for a class without points in a 2k class or a clustering class with the point number less than one percent of that of input data; extracting the first c arrays of the characteristic vector and circulating the operation until the clustering number is stable, wherein corresponding class number is the optimal clustering number; and marking an input data point according to the coordinate of each input point so as to acquire a clustering result. The method has the advantages of high self-adaption performance and high clustering precision rate and can be used for self-adaptively determining an image category number.

Description

technical field [0001] The invention belongs to the technical field of image processing and relates to an image clustering method, which can be applied to the field of image clustering to determine the number of clusters adaptively. Background technique [0002] Image clustering is an important step in image processing. The purpose of image clustering is to cluster different regions on the image into different classes according to the relationship between image pixels. Spectral clustering is an emerging clustering method in recent years. The idea of ​​this algorithm originated from the theory of spectral graph partitioning, and it is regarded as a multi-way partitioning problem of an undirected graph. The reason why spectral clustering is superior to traditional clustering algorithms is that it is not limited by the shape of the sample space and converges to the global optimal solution. Therefore, spectral clustering algorithms have been widely used in the field of image cl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
CPCG06K9/6224G06F18/2323
Inventor 张向荣焦李成杨杰侯彪王爽公茂果刘若辰李阳阳
Owner XIDIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products