Unlock instant, AI-driven research and patent intelligence for your innovation.

A virtual machine snapshot backup method and system based on multi-layer deduplication

A virtual machine and snapshot technology, applied in the computer field, can solve the problems of one-time backup, the incompatibility of cloud computing platforms, and the inability to cope with the backup requirements of virtual machine clusters, and achieve the effect of maximizing resources

Active Publication Date: 2017-05-24
ALIBABA GRP HLDG LTD
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Amazon's technical solution is based entirely on the data modification records of a single virtual machine to determine which data needs to be backed up. Its weakness lies in: first, even if the data in the block is only modified by one byte, the entire block of data must be backed up once
Secondly, when different users back up the same data, such as the operating system and various commonly used software, due to different user behaviors, the disk location of the data cannot be unified, and this method cannot detect this type of duplicate data at all.
[0005] Although EMC's technical solution can eliminate duplicate backup data on a global scale based on data content characteristics, its dedicated storage server is extremely expensive and cannot meet the PB-level backup requirements of virtual machine clusters.
Such solutions are incompatible with cloud computing platforms characterized by cheap and massive data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A virtual machine snapshot backup method and system based on multi-layer deduplication
  • A virtual machine snapshot backup method and system based on multi-layer deduplication
  • A virtual machine snapshot backup method and system based on multi-layer deduplication

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The main idea of ​​the present invention is to divide the virtual machine snapshot into multiple sub-data blocks, and divide each sub-data block into multiple data segments; perform multi-layer deduplication on the virtual machine snapshot, and the multi-layer deduplication operation includes : Perform deduplication of sub-data blocks, data fragments and public data sets in sequence on the virtual machine snapshot, so as to exclude data in the virtual machine snapshot that will cause repeated backups, wherein the public data is centrally stored Backup and store data fragments with a repetition rate higher than a predetermined threshold in the file system; and store the remaining virtual machine snapshot data after multi-layer deduplication processing.

[0027] In order to make the purpose, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and sp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present disclosure provides an example method and system for virtual machine backup based on multilayer de-duplication. A virtual machine snapshot is divided into multiple child data blocks. Each child data block is divided into multiple data segments. Multilayer de-duplication is applied to the virtual machine snapshot to exclude data causing duplicate backup in the virtual machine snapshot. The remaining virtual machine snapshot data after the processing of the multilayer de-duplication is stored.

Description

technical field [0001] The invention relates to the field of computers, in particular to a virtual machine snapshot backup method and system based on multi-layer deduplication. Background technique [0002] At present, general virtual machine systems provide users with system snapshot services, that is, full snapshot backup of virtual machine disk images. The virtual machine snapshot backup system is a subsystem of the virtual machine system, which manages all historical data of PB-level virtual machine users. Therefore, improving the storage efficiency of the snapshot backup system has a very important impact on reducing the user's virtual machine usage cost and improving the cluster's storage usage efficiency. In order to be able to process users' backup data requests in real time and on a large scale while efficiently eliminating redundant data, the virtual machine snapshot backup system needs to meet at least three conditions: high data processing speed, for example, it...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F11/14
CPCG06F11/1453G06F2201/815G06F2201/84G06F16/11G06F16/1748G06F11/1451G06F11/1464G06F11/1466G06F16/128G06F2201/80G06F2201/805G06F2201/82
Inventor 张为唐洪蒋灏曾月李小刚
Owner ALIBABA GRP HLDG LTD