Method for determining optimum cluster number
A clustering number, the best technology, applied in the direction of text database clustering/classification, relational database, database model, etc., can solve the problems of poor calculation efficiency, difficult to accurately determine the k value of clustering number, limitations, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0047] The present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.
[0048] The present invention uses a validity index Q(C) to evaluate the clustering effect of the data set. The validity index measures the quality of clustering mainly through the compactness of data objects within a class and the separation degree of data objects between classes. The related concepts are introduced below.
[0049] 1. Effectiveness indicators
[0050] Suppose for a cube DB, one of the clusters is divided into C k ={C 1 ,C 2 ,...,C k}. At this time, cluster C k The intra-class compactness of is obtained by calculating the sum of the squares of the distances between any two data objects in the same class, using Scat(C k )To represent:
[0051] Scat ( C k ) = Σ i = ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com