Parallelization clustering method for power communication data resources

A technology of data resources and power communication, applied in the field of power system, can solve the problems of the accuracy and efficiency of the clustering algorithm, the low efficiency of clustering data execution, affecting the execution time of the clustering algorithm, etc., so as to improve the clustering effect. , the effect of supporting efficient analysis and utilization, reducing the time of search assignments

Inactive Publication Date: 2018-11-09
GUANGDONG POWER GRID CO LTD +1
View PDF2 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The quality of the initial point selection in the clustering algorithm will have a huge impact on the accuracy and efficiency of the entire clustering algorithm. In addition, the resource allocation schemes of different parallel algorithms will also affect the execution time of the clustering algorithm.
In the analysis of existing patent literature, many clustering algorithms do not optimize the selection of initial points, which has a great impact on the mechanism and clustering speed; in addition, simple random sampling is performed on the original data to complete the data segmentation. The execution efficiency of data initialization is not high; there is also a hierarchical clustering method based on graph generation, which implements a parallel clustering method under the MapReduce parallel framework. The accuracy is not ideal and has certain limitations

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Parallelization clustering method for power communication data resources
  • Parallelization clustering method for power communication data resources
  • Parallelization clustering method for power communication data resources

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] Below in conjunction with the accompanying drawings, the present invention will be further described by way of examples. The accompanying drawings are only used for illustrative descriptions, and the accompanying drawings are only used for illustrative descriptions, and should not be construed as limitations on this patent.

[0057] like figure 1 As shown, a parallelized clustering method for power communication data resources is provided, and the method includes the following steps:

[0058] S1: Parallelize the point density of each sample point in the data set;

[0059] S2: Select k initial cluster centers according to the point density and the distance constraints between them;

[0060] S3: Use the k-medoids algorithm to parallelize the non-center point assignment and center point update on the Hadoop platform until the center point and non-center point of each cluster no longer change, and the final clustering result is obtained.

[0061] like figure 2 As shown,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of power systems and more specifically relates to a parallelization clustering method for power communication data resources. First, point density of different sample points in a data set is calculated in parallel; k initial clustering centers are selected according to the size of the point density and the mutual distance; finally, a k-medoids algorithmis utilized for non-center point distribution and center point update on a Hadoop platform in parallel until the center point and the non-center point of each cluster do not change any more and the final clustering result is obtained. According to the invention, the initial points of the clustering algorithm are selected through combination of the size of the point density calculated according tothe sample points and the mutual distance, the initial cluster center selection mechanism is improved, the clustering iteration operation effectiveness is improved, the search range is reduced and the clustering effect is improved; parallelization resource allocation is optimized, clustering time is shortened and clustering precision is improved, effective analysis and utilization of power data are well supported and the comprehensive benefits of power communication network related data are developed.

Description

technical field [0001] The invention belongs to the technical field of power systems, and more particularly, relates to a parallel clustering method for power communication data resources. Background technique [0002] With the continuous development of the power communication network with the function as the center, a large number of operation and maintenance management systems are produced which are divided into specialties, functions and management domains, which in turn leads to the generation of "island phenomenon" of power operation and maintenance data, which seriously inhibits a large number of scattered The comprehensive benefits of power communication operation and maintenance data are brought into play. How to use distributed systems to better handle these huge and complex power communication operation and maintenance data has become a hot research issue. As an effective means of data processing, cluster analysis supports the integration and classification of a l...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06Q50/06
CPCG06Q50/06G06F18/23213
Inventor 李星南曾瑛林斌付佳佳施展吴赞红
Owner GUANGDONG POWER GRID CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products