Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Density peak-based big data mining method and apparatus

A density peak and big data technology, applied in the field of big data mining methods and devices based on density peaks, can solve problems such as slow convergence speed and sensitive initial cluster centers.

Inactive Publication Date: 2017-05-10
GUANGDONG UNIV OF TECH
View PDF0 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the fuzzy C-means clustering algorithm has some shortcomings, mainly in being sensitive to the initial cluster center, easy to converge to local optimum, slow convergence speed, etc.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Density peak-based big data mining method and apparatus
  • Density peak-based big data mining method and apparatus
  • Density peak-based big data mining method and apparatus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] The embodiment of the present invention provides a density peak-based big data mining method and device, which does not need to specify the number of clusters and accelerates the convergence speed.

[0049] In order to make the purpose, features and advantages of the present invention more obvious and understandable, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the following The described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0050] see figure 1 , an embodiment of a density peak-based big data mining method provided by an embodiment of the present inventio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Embodiments of the invention disclose a density peak-based big data mining method and apparatus. The method comprises a first stage of selecting initial clustering centers by using a density peak clustering algorithm, and a second stage of determining an initial clustering number and accelerating convergence of a conventional fuzzy C-mean algorithm. In the algorithm of the first stage, a decision graph is provided by using density peak clustering, the initial clustering centers are selected, and the initial clustering center number is namely a clustering center number after the selection is finished, so that the clustering number does not need to be manually specified. In the algorithm of the second stage, an iterative process of the conventional fuzzy C-mean algorithm is optimized, the influence of local density on the algorithm is considered, and a density weighting factor is added, so that the algorithm can obtain a global optimal solution in an accelerated way; and an oscillating factor is added, so that the algorithm convergence can be accelerated.

Description

technical field [0001] The invention relates to the field of data mining, in particular to a density peak-based big data mining method and device. Background technique [0002] Clustering is to divide all the objects in the sample space into several groups, so that the objects in the same group have a high similarity, while the objects in different groups have great differences. Among them, fuzzy C-means clustering, as a typical representative of fuzzy clustering, has a wide range of applications. However, the fuzzy C-means clustering algorithm has some disadvantages, mainly in being sensitive to the initial cluster center, easy to converge to local optimum, and slow in convergence speed. [0003] Density peak algorithm (Clustering by fast search and find of density peaks, CFSFDP) is a heuristic algorithm based on density clustering. Density peak clustering is based on the assumption that for a data set, the cluster center is surrounded by some data points with low local d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/285G06F16/2465
Inventor 许青林刘沧生熊梦琪姜文超
Owner GUANGDONG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products