Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

An offline optimal cache replacement device and method for data recovery of deduplication backup system

A technology for backing up systems and data, applied to redundancy in computing for data error detection, memory systems, electrical digital data processing, etc. High overhead, etc., to avoid computing overhead, reduce recovery time, and improve recovery performance

Active Publication Date: 2021-11-12
JINAN UNIVERSITY
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since the logically continuous data is physically dispersed in different disk locations, the fragmentation generated by the traditional deduplication method seriously affects the recovery performance of the system
Some existing optimization methods try to use the optimized cache replacement strategy to improve recovery performance during recovery. However, these methods either have a low hit rate or high additional computing overhead, and cannot effectively resist the impact of fragmentation on recovery performance.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An offline optimal cache replacement device and method for data recovery of deduplication backup system
  • An offline optimal cache replacement device and method for data recovery of deduplication backup system
  • An offline optimal cache replacement device and method for data recovery of deduplication backup system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0030] Such as figure 1 As shown, the structure of the present invention includes the following parts: 1. access sequence file, 2. off-line optimal cache replacement module, 3. replaced container, 4. metadata storage module, 5. data recovery module, 6. recovery data storage module, the system mainly It consists of two modules ② and ⑤. Module ② simulates data recovery by using the optimal cache replacement strategy during the idle period of the system, and provides a reliable recovery cache replacement strategy for the later real data recovery process. The main work of module ⑤ is to use the offline optimal cache replacement module ② to provide information to complete the data recovery.

[0031] An access sequence file, which sequentially records the ID number of the container to which each data block belongs during backup. The system generates access sequence files at the same time as data backup, that is, the generation of this file will not affect the performance of data r...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an off-line optimal cache replacement device and method for data restoration in a deduplication backup system. Because the logically continuous data is physically dispersed in different disk locations, the fragmentation generated by the traditional deduplication method seriously affects the performance of system recovery. At present, some optimization methods try to improve the recovery performance by using the optimized cache replacement strategy during recovery. However, these methods are in When the data locality is not strong, the hit rate of the cache is low or the extra calculation cost of calculating the optimal replacement order online is large, resulting in the inability to effectively resist the impact of fragmentation on recovery performance. To solve the above problems, the present invention starts from offline On the other hand, without sacrificing the deduplication rate of the system and without additional computing overhead, the optimal replacement order can be obtained to effectively improve the recovery performance and throughput, and the offline optimal cache replacement Policies can effectively optimize the recovery time of required files and meet the requirements of modern data storage.

Description

technical field [0001] The present invention relates to the technical field of data recovery of a deduplication system, in particular to an off-line optimal cache replacement device and method for data recovery of a deduplication backup system. Background technique [0002] With the advent of the era of big data, the rapid growth of data volume has brought great challenges to the limited storage space of data centers. Data deduplication technology greatly reduces the disk overhead required for data storage and the bandwidth required for network transmission, and has gradually become a key data reduction technology for today's backup systems. The purpose of deduplication backup system is to store data for timely recovery in the future. If an enterprise fails to recover data in time when a disaster occurs such as disk failure or database-related file corruption, the losses suffered will be immeasurable, and data backup will also become incalculable. It doesn't make much sense...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F11/14G06F12/123
CPCG06F11/1469G06F12/123
Inventor 邓玉辉杨儒
Owner JINAN UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products