Data compression method, data decompression method, data compression system, and related devices for graph data

A compression method and data compression technology, applied in the direction of file system, file system function, special data processing application, etc., can solve the problems of large decoding logic and time-consuming, and difficulty in copying information of multiple nodes in parallel, so as to increase processing bandwidth Effect

Active Publication Date: 2020-04-21
INSPUR BEIJING ELECTRONICS INFORMATION IND
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since the length of column nodes is randomly distributed, it will be very difficult to replicate multiple node information in parallel on hardware
For example, in a graph data with a parallel processing capacity of 128 nodes, the input row nodes correspond to several column nodes, and copying at this time requires huge decoding logic and time-consuming

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data compression method, data decompression method, data compression system, and related devices for graph data
  • Data compression method, data decompression method, data compression system, and related devices for graph data
  • Data compression method, data decompression method, data compression system, and related devices for graph data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] In order to make the purposes, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments It is a part of the embodiments of this application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0039] Please refer to figure 1 , figure 1 It is a flow chart of a compression method for a graph computing system provided by an embodiment of the present application. The method includes

[0040] S101: Obtain original graph data and determine node pairs of each edge in the original graph data;

[0041] The original graph data exists in the COO fo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a graph data compression method. The method comprises the steps of obtaining original graph data and determining a node pair of each edge in the original graph data; grouping the same source nodes in the node pairs according to a preset grouping rule, wherein one source node is reserved in each group; storing the data into the corresponding FIFO according to the group type of each group; adding labels to the source node data of the source node and the destination node data of the destination node; and polling and storing the FIFO to obtain compressed graph data. According to the method, different compression ratios are adopted for the source nodes with different numbers corresponding to the destination nodes, only one source node is reserved in the same group, and repeated source nodes do not need to be stored; and large-scale parallel decompression of the graph data can be supported, and the processing bandwidth of the graph data is greatly increased. The invention further provides a data compression system of a graph calculation system, a computer readable storage medium and a data compression terminal, which have the above beneficial effects.

Description

technical field [0001] The present application relates to the field of data storage, and in particular to a method for compressing and decompressing image data, a compression system and related devices. Background technique [0002] In the era of big data, a large amount of data interacts with each other to form a large-scale graph, which is widely used in various algorithms such as PageRank. At present, the scale of the graph often reaches the order of tens of millions to hundreds of millions of nodes, and the edge (connection between nodes) information of the graph reaches the scale of billions. Usually, Internet graph data is in the form of a sparse matrix, and the storage methods that can be used include COO and CSR. COO uses the triplet format of <source node, destination node, weight>. Considering that a node is represented by 4 bytes, a Edges require 12 bytes of storage. CSR considers that the same row node has the same address, and only stores the same row of...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/17
CPCG06F16/1744
Inventor 梅国强郝锐王江为阚宏伟
Owner INSPUR BEIJING ELECTRONICS INFORMATION IND
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products