Operation system based on hierarchical large-scale graph data

A computing system and graph data technology, applied in data processing applications, electrical digital data processing, special data processing applications, etc., can solve the problems of different divided regular edge data blocks and low data accuracy.

Active Publication Date: 2018-04-20
合肥亚慕信息科技有限公司
View PDF7 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For example, in GraphChi (a stand-alone graph computing and processing platform), since the computing mode is centered on the destination node during graph computing, the computer sorts the graph data according to the ID (identification) of the destination node from small to large. The edge data in GraphChi is divided into multiple edge data blocks (called shards in GraphChi), all the edge data corresponding to the same destination node are divided into one edge data block, but the edge data blocks obtained by different segmentation rules are different, resulting in The data obtained when the final merge is less accurate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Operation system based on hierarchical large-scale graph data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] A computing system based on hierarchical large-scale graph data, such as figure 1 As shown, it includes a graph data acquisition unit, a graph data analysis unit, and a graph data management unit;

[0018] The graph data acquisition unit is used to collect large-scale graph data, and perform noise filtering on the graph data through median filtering, and then transmit the processed graph data to the graph data analysis unit and the graph data management unit;

[0019] The graph data analysis unit regularly separates the preprocessed graph data into different sub-data, and distributes the sub-data to the corresponding computing nodes, and then makes statistics on the results calculated by each computing node, and merges the statistical results. The data calculated by each computing node and the merged data are transmitted to the graph data management unit; the graph data analysis unit includes a graph data segmentation module, a statistics module, and a graph data mergin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an operation system based on hierarchical large-scale graph data. The system comprises a graph data collection unit, a graph data analysis unit and a graph data management unit, wherein the graph data analysis unit comprises a graph data partitioning module, a statistical module and a graph data merging module; the graph data management unit comprises an operation module, acomparison module and an alarm module; and the graph data collection unit is used for collecting the large-scale graph data, performing noise filtering processing on the graph data through median filtering and then transmitting processed graph data to the graph data analysis unit and the graph data management unit. According to the system, the graph data is partitioned according to adjacent nodesof the graph data after being preprocessed, then the partitioned graph data is integrated, boundary node collection is performed on the preprocessed graph data to obtain an original boundary, meanwhile, the original boundary is compared with the integrated data, the precision of the partitioned data is judged, and therefore the accuracy of the graph data is ensured.

Description

technical field [0001] The invention belongs to the field of large-scale graph data processing, and relates to a computing system based on hierarchical large-scale graph data. Background technique [0002] In the era of big data mining, graphs can not only directly describe many practical applications in the fields of computer science, chemistry, and bioinformatics, such as social networks, web (web) graphs, chemical substances, and biological structures, but also can be used to describe various A data mining algorithm, such as matrix factorization or shortest path, etc. Among them, the graph includes a plurality of nodes and the edges connecting each node, the graph data includes the node data of each node and the edge data of the edges connecting each node, and the edge data of an edge includes the source node, the destination node and the The weight of the edge. In a stand-alone graph computing processing platform (that is, a processing platform that uses a single compu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06K9/62G06Q50/00
CPCG06F16/9024G06Q50/01G06F18/22
Inventor 姚伟强周基初张宇郑凯
Owner 合肥亚慕信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products