Unlock instant, AI-driven research and patent intelligence for your innovation.

A Hierarchical Clustering Method for Tourist Destination Data

A technology of data grading and clustering method, applied in instrument, calculation, character and pattern recognition, etc., to achieve the effect of improving accuracy, reducing complexity, and simplifying algorithms

Active Publication Date: 2021-10-29
ZHEJIANG UNIV OF TECH
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Aiming at the defect that multi-scale data is mostly limited to space, the present invention uses multi-level analysis to propose a new distance-based multi-level clustering algorithm according to actual needs, and uses multi-level fast non-convex clustering to solve multi-scale density data. clustering problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Hierarchical Clustering Method for Tourist Destination Data
  • A Hierarchical Clustering Method for Tourist Destination Data
  • A Hierarchical Clustering Method for Tourist Destination Data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] In order to clarify the purpose, technical solutions and advantages of the present invention, the present invention will be further described in detail below in conjunction with specific embodiments and accompanying drawings.

[0049] refer to Figure 1 to Figure 6 , a hierarchical clustering method based on region growth and competition for variable-scale data density space, including the following steps:

[0050] The first level: update the cluster center by drawing a circle with the distance threshold R1, the process is as follows:

[0051] Step 1.1: Input a set of unlabeled data sets X={x 1 ,x 2 ,...x i ,...x N}∈R P , where x represents the sample points in the data set, P represents the sample dimension, N represents the number of samples, randomly select the i-th data object x from X i , as the first cluster center point and stored in the set C={}; then randomly select the jth data object x in X j , use formula (1) to calculate x i x j Euclidean distance...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A hierarchical clustering method for tourist destination data based on regional growth and competition in a variable-scale data density space is different from the previous method. The present invention adopts the idea of ​​hierarchical clustering and divides the clustering process into three levels. The first-level clustering uses a distance threshold R1 based on Euclidean distance to divide objects into a certain number of sub-classes to simplify the algorithm and reduce the complexity. Then the second stage uses the method of spatial data region growth, using the obtained cluster centers as growth seeds, and grows under the growth criterion until the stopping condition is reached to solve the problem of variable-scale data density clustering. The last third level is based on the idea of ​​competition and the principle of density similarity, calculates the weight between the cluster centers, and adopts appropriate rules to merge the clusters to solve the problem of non-convex data clustering. Compared with other clustering algorithms, the method of the invention can greatly improve the accuracy of clustering on the basis of reducing the complexity, has obvious advantages in processing massive data, and can better meet the needs of practical engineering applications.

Description

technical field [0001] The invention relates to the field of hierarchical clustering, and specifically uses a method based on region growth and competition to improve the clustering method of variable-scale density data. Background technique [0002] Data mining is a hot issue in the field of artificial intelligence and database research. Cluster analysis is an important branch of data mining. As a tool for data analysis, clustering has been widely used in various fields. Clustering is the process of dividing a physical or abstract collection into classes consisting of similar objects. Clustering originated from taxonomy, but it is different from classification. The difference between clustering and classification is that the classes required for clustering are unknown and unsupervised. Clustering algorithms are roughly divided into (1) partition-based methods, such as K-means algorithm, K-medoids algorithm, etc.; (2) hierarchical-based methods, such as BIRCH algorithm, CU...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62
CPCG06F18/2321
Inventor 何熊熊袁志琴庄华亮
Owner ZHEJIANG UNIV OF TECH