Pruning mode-based DBSCAN block segmentation optimization method

An optimization method and block technology, applied in the field of DBSCAN algorithm optimization, can solve problems such as low correlation between data and inapplicability, achieve high anti-interference, improve efficiency, and improve clustering efficiency

Inactive Publication Date: 2018-07-31
SOUTH CHINA UNIV OF TECH
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The main purpose of the present invention is to overcome the low correlation between the data that solves the traditional DBSCAN algorithm of prior art and needs O(N 2 ) time complexity implementation algorithm, and it cannot be applied to the same data set with different cluster density requirements and the situation where two clusters with low correlation are associated together due to linear connection. A DBSCAN block optimization based on pruning method is proposed method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Pruning mode-based DBSCAN block segmentation optimization method
  • Pruning mode-based DBSCAN block segmentation optimization method
  • Pruning mode-based DBSCAN block segmentation optimization method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0049] This embodiment discloses a DBSCAN block optimization method based on pruning mode, using data pruning block algorithm, block data dynamic clustering algorithm, data block merging node calculation method and data block effective merging algorithm, according to the actual data The clustering parameters are dynamically adjusted according to the situation to effectively improve the clustering accuracy of the DBSCAN algorithm. At the same time, the efficiency of DBSCAN is greatly improved by effectively dividing the data into blocks. In addition, the algorithm optimization of the merging process of DBSCAN block data is used to effectively solve the problem of two clusters with low correlation. A clustering process in which classes are linked together by linear links.

[0050] Such as Figure 7 As shown, the method of this embodiment specifically includes the following steps:

[0051] (1) Data pruning and block algorithm:

[0052] 1. Read the data. By building a linked lis...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a pruning mode-based DBSCAN block segmentation optimization method. The method involves a data pruning block segmentation algorithm, a block data dynamic clustering algorithm,a data block combination node calculation method and a data block effective combination algorithm. The method comprises the main steps of calculating a dimension block quantity, traversing all dimensions from a dimension to perform block segmentation on data, and performing pruning operation on data blocks not meeting a continuous block segmentation requirement; dynamically calculating DBSCAN important parameters according to block data density, and dynamically realizing a block clustering process; judging whether two data blocks come from a same father data block or not through formula calculation, and performing a block combination process by taking the judgment as a standard; and performing combination algorithm implementation on data blocks needed to be combined, and solving a clustering process of associating two clusters with low association together due to linear connection by utilizing the combination process. Through the method, the clustering effect and efficiency of DBSCAN are improved; the DBSCAN is applicable to more diversified scenes; and the application of the DBSCAN to conventional fields and big data analysis is promoted.

Description

technical field [0001] The present invention relates to DBSCAN algorithm optimization technology, in particular to a pruning-based parallel accelerated DBSCAN algorithm implementation method, so that DBSCAN can be applied to a wider range of scenarios and promote the application of DBSCAN in traditional fields and big data analysis. Background technique [0002] In the information society, with the continuous development of information technology such as cloud computing and artificial intelligence, and the widespread application of big data technology, using Spark to solve data clustering problems and dig out the rich information contained in massive data has become an increasingly An important way to pursue personalized service. The DBSCAN algorithm is widely used in data mining. Using a density-based method, the high-density area is divided into clusters, and clusters of arbitrary shapes can be found while eliminating the influence of noise. However, the parallel implemen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/2465G06F16/2246G06F16/2453G06F16/285
Inventor 何克晶蔡梓浩
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products