Spectral clustering algorithm parallelization method in abnormal data detection and system
A technology of abnormal data detection and spectral clustering algorithm, which is applied in the direction of digital data information retrieval, file system, file system type, etc., can solve the problem of low execution efficiency of spectral clustering algorithm and the inability of stand-alone storage system to meet the storage requirements of massive data, etc. problem, to achieve low latency, reduce difficulty, and improve computing efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0033] In order to facilitate the understanding of the present invention, the present invention will be understood in connection with the accompanying drawings and examples, and the embodiments described herein are intended to illustrate and explain the present invention. this invention.
[0034] Please see figure 1 An abnormal data detection method provided by the present invention is parallelized by the spectrum clustering algorithm, including the following steps:
[0035] Step 1: Data sets of data to be clustered by data distributed storage;
[0036] In this embodiment, a data set sample to be clustered is divided into several data blocks, and these data blocks are abstracted into RDD objects, and these RDDs are assigned to several working nodes in the Spark cluster for storage, deposit open source distribution. File system HDFS.
[0037] Please see figure 2 The detailed process of data distributed storage is displayed. HDFS contains a NameNode and several DataNode (data nodes)...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


