Unlock instant, AI-driven research and patent intelligence for your innovation.

A clustering result semantic feature extraction and visualization method based on a strong item set

A technology of semantic features and clustering, applied in the direction of instruments, character and pattern recognition, computer components, etc. Semantic features of class results and other issues to achieve the effects of enhanced interpretability, high execution performance, and easy understanding

Active Publication Date: 2019-04-26
NORTHEASTERN UNIV LIAONING
View PDF12 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, there are some patents related to the interpretation and visualization of clustering results. Patent 201010194391.7 proposes a method for visualizing clustering analysis results, which realizes the clustering results of data information objects, the structural relationship between clustering categories and Its association and the semantic similarity between data information objects can be visually expressed, but it cannot reflect the semantic characteristics of each cluster in the clustering result set; a Radviz-based fuzzy clustering result visualization method proposed in patent 201610341872.3 is mainly based on Radviz with multi-dimensional matrix The form realizes the visualization of the distribution of membership, the size of clusters, the relationship between clusters, etc., and lacks the visualization in clustering semantic information; patent 201810255690.3 proposes a feedback clustering based on cluster semantic feature analysis Class method, which mainly uses K-means clustering method, does not design clustering result interpretation and visualization technology for any clustering method
Due to the lack of modeling and analysis of the semantic feature information of the clustering results of the general clustering method, the above patents cannot effectively describe the semantic information of each cluster in the clustering result set and enhance the interpretability of the clustering result set. Difficulties in recognizing and understanding clustering result sets

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A clustering result semantic feature extraction and visualization method based on a strong item set
  • A clustering result semantic feature extraction and visualization method based on a strong item set
  • A clustering result semantic feature extraction and visualization method based on a strong item set

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] An embodiment of the present invention will be further described below in conjunction with the accompanying drawings.

[0060] The example of the present invention takes the Breast-Cancer-Wisconsin data set in UCI as the research object, and this data set has 699 examples altogether; 10 attributes (sample number, clot thickness, cell size uniformity, cell shape uniformity, edge stickiness) Adhesion, single epithelial cell size, naked nucleus, plain chromatin, normal nucleolus, mitosis, the values ​​are all integers from 1 to 10); the cluster label is Class (the value is 2 (“benign (benign) ”) and 4 (“malignant”)).

[0061] A semantic feature extraction and visualization method for clustering results based on strong itemsets, the flow chart of the method is as follows figure 1 shown, including the following steps:

[0062] Step 1. Cluster semantic feature modeling based on strong itemsets;

[0063] In the embodiment of the present invention, the Breast-Cancer-Wisconsi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of computer information processing, and provides a clustering result semantic feature extraction and visualization method based on a strong item set. According to the method, firstly, a cluster semantic feature model based on a strong item set is constructed through analysis, the feature of each cluster can be described visually and effectively, and theinterpretability of a clustering result set is improved; Then a cluster semantic feature extraction algorithm CLCE based on a strong item set is provided, the algorithm has high performance, and cluster semantic features oriented to the strong class set can be effectively extracted; And finally, a cluster semantic feature visualization method is provided, a domain expert is further helped to understand each cluster in the clustering result set, and the application of the mined related knowledge model is promoted.

Description

technical field [0001] The invention belongs to the technical field of computer information processing, and proposes a semantic feature extraction and visualization method of a clustering result based on a strong item set. Background technique [0002] Clustering is one of the widely used technologies in the field of data analysis. It is a common method to analyze data with the idea of ​​"like flock together" without pre-specifying categories. However, the result of clustering—the interpretability of the clustering result set is the key to the success of clustering analysis methods at the application level. Therefore, for many clustering applications, the interpretation and visualization of clustering results is more important than the clustering itself. Although there are many existing clustering models, many new clustering research results have appeared in many aspects such as basic clustering algorithms, semi-supervised clustering, time series clustering, graph or networ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/23
Inventor 张明卫何秀秀肖云龙季子其
Owner NORTHEASTERN UNIV LIAONING