Continuous attribute discretization method based on Canopy clustering and BIRCH hierarchical clustering
A technology of hierarchical clustering and clustering method, applied in text database clustering/classification, structured data retrieval, unstructured text data retrieval, etc., can solve the problems of unreasonable discretization and poor discretization effect.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0050] In order to have a further understanding and understanding of the structural features of the present invention and the achieved effects, the preferred embodiments and accompanying drawings are used for a detailed description, as follows:
[0051] likefigure 1 As shown, a kind of continuous attribute discretization method based on Canopy clustering and BIRCH hierarchical clustering according to the present invention comprises the following steps:
[0052] The first step is to use Canopy clustering to achieve initial clustering of continuous attribute data. Set reasonable distance thresholds T1 and T2, where the thresholds T1 and T2 are the measures for dividing the size of Canopy. T1 determines the number of points contained in each Cluster, which directly affects the "center of gravity" and "radius" of the Cluster, while T2 determines If T2 is too large, there will be only one Cluster, and if it is too small, there will be too many Clusters. The specific determination o...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com