A data backup method for cloud computing platform based on clustering

A cloud computing platform and data backup technology, applied in the field of cloud computing, can solve problems such as poor system and cluster load balancing capabilities, too much data redundancy, and no consideration of differences in different data, etc., to achieve the effect of improving storage performance

Inactive Publication Date: 2017-11-10
WUHAN UNIV OF TECH
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This unified backup strategy does not take into account the differences between different data, which may lead to too much data redundancy in the system, relatively low storage efficiency, and poor system and cluster load balancing capabilities in practical applications.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A data backup method for cloud computing platform based on clustering
  • A data backup method for cloud computing platform based on clustering

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0027] like figure 1 As shown, the number and location of each data backup are different, and data 1-5 are the data in 5 different clusters in the result of clustering. Backup rules can be formulated for each type of data as required. For data 1 (such as data that is not used for a long time and is not important), its backup rule can be formulated to be backed up only once on the local node. For data 2 (such as data that is rarely used and basically not modified), in order to ensure security, its backup rule can be formulated as the number of backups is 2, and the backup location is on different nodes of the same rack. For data 3 (for example, data that has a certain amount of usage and usage time and will be modified), its backup rule can be formulated as the number of backups is 3, and two copies are placed on different nodes of t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention relates to a kind of cloud computing platform data backup method based on clustering, and this method comprises: (1) confirm key factor according to user's demand; (2) introduce association rule to discover the correlation of key factor, determine the number of divided clusters; 3) Clustering the data records containing key factors; (4) According to step (3) for each time period, formulate the number of data backups and the backup location in units of clusters. According to the different usage situations of different data, the method of the present invention formulates a specific backup strategy for the data including the number of backups and backup locations, effectively solving the problem that too much data redundancy in the system affects the load capacity of the system, thereby effectively improving the system or cluster. storage performance.

Description

technical field [0001] The invention relates to the field of cloud computing, in particular to a data backup method for a cloud computing platform based on clustering. Background technique [0002] The data backup strategy is a corresponding backup management strategy formulated for the data backup requirements of different backup nodes. It is a set of rules defined by the system administrator on the management server for managing data security backup, archiving and hierarchical storage. After formulating a data backup strategy, you can back up data of different types and purposes according to the backup rules in specified numbers and locations. [0003] The current backup strategy on the cloud platform uses HDFS for data replication by default. The HDFS distributed file system adopts a unified backup strategy of three copies, and the backup location is specified. HDFS places two copies on different nodes of the same rack. , another replica is placed on a node in a differen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F11/14H04L29/08
Inventor 钟珞杨光李琳唐琨皓
Owner WUHAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products