Network text data detection method based on fuzzy cluster
Patent Information
- Authority / Receiving Office
- CN ยท China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- SHAANXI DEVTEK TECH DEV
- Publication Date
- 2010-06-30
- Estimated Expiration
- Not applicable ยท inactive patent
Smart Images
Figure 1 Figure 2 Figure 3
Abstract
Description
technical field
[0001] The invention relates to a data detection method, in particular to a network text data detection method. Background technique
[0002] About 80% of the information in the network is in the form of text, so the research on text data mining technology has become an increasingly popular and very important research topic in data mining. Web content clustering is a fully automatic processing process for grouping similar texts in web content into a group, and it is an unsupervised learning process. The purpose of clustering is to distinguish and classify physical or abstract objects according to the similarity between objects. According to the form of data division, the clustering method can be divided into: when there is a clear boundary in the division, it is called hard division, that is, the data is divided into a certain class; the division without clear boundaries is called fuzzy division, that is, the given data is divided into The form of the degre...