Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Expansive entropy information bottleneck principle based clustering method

An information bottleneck and clustering method technology, applied in the field of data mining, can solve problems such as unsupervised clustering analysis

Inactive Publication Date: 2010-02-24
SHANDONG COMP SCI CENTNAT SUPERCOMP CENT IN JINAN
View PDF0 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since there is no prior target as a reference, this problem is an unsupervised clustering analysis problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Expansive entropy information bottleneck principle based clustering method
  • Expansive entropy information bottleneck principle based clustering method
  • Expansive entropy information bottleneck principle based clustering method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] In order to better understand the present invention, the present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0026] Clustering is the process of dividing a data set into several classes by analyzing the correlation between variables, so that the differences within a class are small and the differences between classes are large. The most important thing in the clustering process is how to measure the correlation between data sets, and to give clustering an objective clustering standard.

[0027] The present invention proposes a clustering method based on the extended entropy information bottleneck theory, and the specific operation method of the method is introduced as follows:

[0028] information bottleneck theory

[0029] First introduce the information bottleneck theory. Given a target set, the bottleneck-based clustering method seeks to minimize the information loss between target classes an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an expansive entropy information bottleneck principle based clustering method, which mainly aims at the problem of unsupervised clustering in data mining and the problem of array set clustering when complicated relationship exists among clustered arrays but cannot be described by statistic probability. The method not only can indicate the relevance among the arrays, but also can indicate the correspondence among positions corresponding to the arrays, and can provide an objective cluster truncation rule so as to effectively avoid the defect of designating the clusteringnumber subjectively. The clustering method can be widely applied to clustering problems in the fields of medicine, intelligent transportation, mode identification and the like.

Description

technical field [0001] The invention relates to the field of data mining, in particular to cluster analysis. Background technique [0002] Cluster analysis is an important research content in the field of data mining, which aims to divide the data set into several classes, so that the differences within the class are small and the differences between classes are large. The distance between data is usually used to describe the similarity, the larger the distance, the smaller the similarity, and vice versa. Cluster analysis is widely used in pattern recognition, data analysis, image processing, intelligent transportation, medicine and other fields. Many effective clustering methods have been formed, such as K-means method, k-center method, self-organizing neural network, Bayesian neural network, Fisher clustering and so on. Different clustering methods have different starting points, which are mainly reflected in the measurement of the distance between samples or variables. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 孙占全
Owner SHANDONG COMP SCI CENTNAT SUPERCOMP CENT IN JINAN
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products