Supercharge Your Innovation With Domain-Expert AI Agents!

An Ensemble Method for Selective Text Clustering Based on Spectral Graph Theory

A text clustering and integration method technology, applied in the field of text clustering, can solve the problems of low clustering accuracy and low robustness, and achieve the effects of improving accuracy, improving robustness, and reducing computing time

Active Publication Date: 2022-08-02
YANCHENG INST OF TECH +1
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, the above-mentioned single clustering methods can only perform text clustering in the case of a specific size, shape, and noise clusters, and their robustness is low, which leads to the problem of low final clustering accuracy.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An Ensemble Method for Selective Text Clustering Based on Spectral Graph Theory
  • An Ensemble Method for Selective Text Clustering Based on Spectral Graph Theory
  • An Ensemble Method for Selective Text Clustering Based on Spectral Graph Theory

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0073] The preferred embodiments of the present invention will be described below with reference to the accompanying drawings. It should be understood that the preferred embodiments described herein are only used to illustrate and explain the present invention, but not to limit the present invention.

[0074] The embodiment of the present invention provides a selective text clustering integration method based on spectral graph theory, please refer to Figure 1-Figure 8 ,like figure 1 As shown, the method includes the following steps:

[0075] S100, using the K-means algorithm to generate cluster members from the text data set;

[0076] S200, using a spectral clustering algorithm to select representative members from the generated cluster members;

[0077] S300, using a hierarchical clustering method to integrate the selected representative members;

[0078] S400, the integrated representative members constitute the clustering result of the article.

[0079] The working pri...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a selective text clustering integration method based on spectral graph theory. The text data set adopts K-means algorithm to generate cluster members; and the spectral clustering algorithm is used to select representative members from the generated cluster members; The selected representative members are integrated by hierarchical clustering method; the integrated representative members constitute the clustering results of this paper. It solves the problem of large amount of calculation caused when the spectral clustering method is directly applied to high-dimensional, sparse, and massive text data sets. Therefore, the use of this scheme significantly reduces the calculation time of text clustering and effectively improves the performance of text clustering. accuracy. In addition, in this embodiment, the K-means algorithm is used as the base clusterer to randomly select the initial centroids, the algorithm complexity is low, and the robustness of the algorithm is improved.

Description

technical field [0001] The invention relates to the technical field of text clustering, in particular to a selective text clustering integration method based on spectral graph theory. Background technique [0002] With the continuous development of the era of big data, text information is produced in large quantities and is rich in value. How to rationally use this text information has become an opportunity and adjustment for people. As one of the important means of text data mining, text clustering is to classify a large amount of disorganized text information reasonably through similarity judgment. Since text clustering does not require a training process, it has become an effective way to organize and summarize text information. and an important means of navigation. [0003] At the same time, clustering, as an unsupervised machine learning method with a high degree of automation, has been widely used in information retrieval, automatic multi-document summarization and ot...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/35G06K9/62G06F40/242G06F40/289
Inventor 徐森陈明权徐秀芳花小朋皋军安晶王江峰嵇宏伟姜陈雨陆湘文
Owner YANCHENG INST OF TECH
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More