Concurrent job backup method for mass data backup

A massive data, backup technology, applied to the redundancy in the operation for data error detection, digital data processing, response error generation, etc. Backup system single point of failure and other problems, to achieve the effect of reducing management burden, reducing data theft, and consistent configuration

Active Publication Date: 2018-11-13
INST OF HIGH ENERGY PHYSICS CHINESE ACAD OF SCI
View PDF5 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, for massive data containing tens of millions or even billions of files, it takes days or even weeks, and the backup task cannot be completed within an acceptable time frame
[0007] 2. There is a possibility of a single point of failure in the backup system
Once a server fails, the backup and recovery services defined on that server cannot continue
[0008] 3. The backup software adopts a custom storage format for safety reasons. The backup files depend on the backup software. When the software fails, the backup files cannot be used, resulting in the result that having backup is equal to no backup

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Concurrent job backup method for mass data backup
  • Concurrent job backup method for mass data backup
  • Concurrent job backup method for mass data backup

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The present invention will be further described below in conjunction with specific examples.

[0034] Take a backup of the / home directory on a machine named login as an example. First of all, the machine named bak01 is the server responsible for the backup. It first starts the self-check process of the selfserver service, checks that the status of the machine is normal, and the configuration related to / home can be obtained, and each backup process exists. Then start the backup daemon backup_agent for / home. Start the checkconf script in backup_agent, read the backup policy of the defined / home directory, and return the required parameters:

[0035] 1. Backup source directory: login: / home;

[0036] 2. Backup frequency: run a backup every day;

[0037] 3. Backup level: 0 (full backup is used for this backup);

[0038] 4. Access level: private (non-public, only the root user on the login machine can recover data);

[0039] 5. Storage directory: bak01: / gluster / daily / ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a concurrent job backup method for mass data backup. The method comprises the following steps of: 1) selecting multiple backup nodes to for a backup cluster, wherein the backupnodes have uniform configuration; 2) selecting a backup node to serve as a backup management server by a terminal, and starting a backup strategy of an object needing backup; 3) selecting a backup node to serve as a job scheduler by the backup management server, obtaining a directory structure corresponding to the object needing backup layer by layer, and generating a scanning job when a directory is obtained; 4) submitting a job path corresponding to each scanning job to the job scheduler by the backup management server, and sending the job paths to the backup nodes by the job scheduler so as to scan target directories in the scanning jobs; 5) selecting a file needing backup by the backup management server and generating a plurality of file sub-tables, generating a copy job according toeach sub-table, and sending the copy jobs to the jog scheduler; and 6) sending different copy jobs to different backup nodes by the job scheduler so as to copy the file needing backup to a corresponding position.

Description

technical field [0001] The invention relates to a data backup method, in particular to a parallel job backup method for mass data backup. Background technique [0002] Data is crucial to an enterprise, department, unit or individual. Due to various reasons, such as equipment failure, hacker virus, human misoperation, etc., once data information is lost or destroyed, it will cause inestimable losses, which makes data backup very important. Data backup is a data security strategy, making a copy of key data so that when a failure occurs, the data can be restored through the backup software to avoid losses caused by data loss. [0003] With the continuous development of information technology, emerging things such as cloud computing, Internet of Things, and social networks have caused the type and scale of data in human society to explode on a global scale. As of 2012, the amount of data has jumped from TB (1TB=1024GB) level to PB (1PB=1024TB), EB (1EB=1024PB) and even ZB (1ZB...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/14
CPCG06F11/1461
Inventor 姚秋玲陈德清
Owner INST OF HIGH ENERGY PHYSICS CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products