Data storage method and system suitable for social network graph

A social network and data storage technology, applied in the field of cloud storage, can solve the problems that the migration cost affects the system performance, the real-time system cannot be used, and the average degree of vertices is high, so as to reduce the number of requests and communication costs, speed up request processing, and speed up The effect of processing speed

Inactive Publication Date: 2013-10-09
PEKING UNIV
View PDF4 Cites 29 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

These algorithms are problematic: First, social network graphs are different from general graphs in that they have high average degree of vertices, close connections, and small diameters, which conform to Power-law characteristics. Research has proved that such graphs are difficult to segment, and the effect Similar to simple hashing, cannot be used in real-time systems
Secondly, in the production environment, the social network is changing frequently, and these algorithms have to re-segment the whole graph every time, which cannot meet the real-time requirements of the production environment.
Third, these algorithms are sensitive to input. Small changes in the graph structure will cause huge changes in the segmentation results. The resulting migration cost will greatly affect system performance, which is unbearable in real-time systems.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data storage method and system suitable for social network graph
  • Data storage method and system suitable for social network graph
  • Data storage method and system suitable for social network graph

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The present invention will be described in detail below through specific embodiments and accompanying drawings.

[0040] The data storage method applicable to social network graphs of the present invention designs a segmentation strategy based on edge placement to meet the system's requirements for real-time and incremental segmentation. In this strategy, a vertex v can be split into at most k child vertices {v′ 1 ,v′ 2 ,...v' k}, k is configurable in the system, usually 2 to 5 according to the size of the cluster; vertex v can also not be split. These sub-vertices are distributed in different partitions, and each sub-vertex has all the vertex data of vertex v and the data on the corresponding edge, and satisfies:

[0041]

[0042] ∪ i - 1 k π ( v i ′ ) ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a data storage method and system suitable for a social network graph. The method includes the steps that graph structure information of the social network graph is read in a side list or an adjacency list mode; a peak is separated into a plurality of sub peaks, and each sub peak has all the peak data and data on corresponding sides of the peak; with regard to sides, which are not distributed in partitions, in the graph, firstly the sides are distributed to one partition, and at the moment sides spanning the partitions are not generated; if the distribution is not succeed, one side spanning the partitions is generated. The system comprises a plurality of management nodes, a plurality of work nodes, and ZooKeeper clusters. Only one management node is in the working state at a same moment, and each work node comprises a PHT index, a graph structure storage, a request processor and a database interface. The data storage method and system suitable for the social network graph can be increased to reflect the change of a social network graph structure, reduces communication cost among servers, increases overall throughput of the servers, and improves user experience.

Description

technical field [0001] The invention belongs to the field of cloud storage, specifically designing a data storage method and system suitable for social network graphs. Background technique [0002] With the continuous development of social networks such as Renren.com, Facebook, and Weibo, more and more UGC (User Generated Content) data are posted to social networks by users, and the data in social networks is growing explosively. Internet companies pose serious challenges. [0003] In order to cope with the increasing data expansion, due to the needs of fault tolerance and load balancing, the existing website architecture uses traditional distributed technology, adopts horizontal expansion, and uses simple hash, consistent hash or DHT technology to distribute data in multiple servers. The advantage of this data placement method is that it is uniform, consistent, and easy to maintain, but it ignores the data access mode of the social network. [0004] Friend relationships ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 曲直薛继龙杨智代亚非
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products