An offline optimal cache replacement device and method for data recovery of a deduplicated system

A backup system, cache replacement technology, applied in the redundant operation of data error detection, memory system, electronic digital data processing and other directions, can solve the impact of not being able to effectively resist fragmentation recovery performance, affecting system recovery performance, Problems such as high computing overhead can avoid computing overhead, reduce recovery time, and improve recovery performance.

Active Publication Date: 2018-12-11
JINAN UNIVERSITY
View PDF5 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since the logically continuous data is physically dispersed in different disk locations, the fragmentation generated by the traditional deduplication method seriously affects the recovery performance of the system
Some existing optimization methods try to use the optimized cache replacement strategy to improve recovery performance during recovery. However, these methods either have a low hit rate or high additional computing overhead, and cannot effectively resist the impact of fragmentation on recovery performance.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An offline optimal cache replacement device and method for data recovery of a deduplicated system
  • An offline optimal cache replacement device and method for data recovery of a deduplicated system
  • An offline optimal cache replacement device and method for data recovery of a deduplicated system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0030] like figure 1 As shown, the structure of the present invention includes the following parts: 1. access sequence file, 2. off-line optimal cache replacement module, 3. replaced container, 4. metadata storage module, 5. data recovery module, 6. recovery data storage module, the system mainly It consists of two modules ② and ⑤. Module ② simulates data recovery by using the optimal cache replacement strategy during the idle period of the system, and provides a reliable recovery cache replacement strategy for the later real data recovery process. The main work of module ⑤ is to use the offline optimal cache replacement module ② to provide information to complete the data recovery.

[0031] An access sequence file, which sequentially records the ID number of the container to which each data block belongs during backup. The system generates access sequence files at the same time as data backup, that is, the generation of this file will not affect the performance of data reco...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an offline optimal cache replacement device and method for data recovery of a deduplication system. Because logically contiguous data is physically scattered across different disk locations, the performance of system recovery is seriously affected by the debris generated by traditional de-duplication methods. At present, some optimization methods attempt to improve that recovery performance by utilizing the optimize cache replacement strategy, however, when the data locality is not strong, the cache hit rate of these methods is low or the extra computational overhead consumed when calculating the optimal replacement order on line is large, which leads to the problem that the fragmentation can not be effectively resisted on the recovery performance. To solve the above problem, the invention calculates the optimal cache replacement strategy from the offline perspective; moreover, without sacrifice of the system deduplication rate and additional computational overhead, the optimal replacement order can be obtained to effectively improve the recovery performance and throughput. The offline optimal cache replacement strategy can effectively optimize the requiredfile recovery time and meet the requirements of modern data storage.

Description

technical field [0001] The present invention relates to the technical field of data recovery of a deduplication system, in particular to an off-line optimal cache replacement device and method for data recovery of a deduplication backup system. Background technique [0002] With the advent of the era of big data, the rapid growth of data volume has brought great challenges to the limited storage space of data centers. Data deduplication technology greatly reduces the disk overhead required for data storage and the bandwidth required for network transmission, and has gradually become a key data reduction technology for today's backup systems. The purpose of deduplication backup system is to store data for timely recovery in the future. If an enterprise fails to recover data in time when a disaster occurs such as disk failure or database-related file corruption, the losses suffered will be immeasurable, and data backup will also become incalculable. It doesn't make much sense...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/14G06F12/123
CPCG06F11/1469G06F12/123
Inventor 邓玉辉杨儒
Owner JINAN UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products