Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Fragment removing method and system based on global statistics

A de-fragmentation and global technology, applied in the direction of responding to the generation of errors, redundant data in the calculation, etc., can solve problems such as decline, and achieve the effect of improving recovery performance

Active Publication Date: 2014-06-25
HUAZHONG UNIV OF SCI & TECH
View PDF6 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The purpose of the present invention is to propose a method of defragmentation for the problem that the recovery performance of the cloud backup system based on the data deduplication technology gradually decreases with the increase of the number of versions, that is, to find out the data fragments in the backup data stream, and store these Data fragmentation and new data are written into the segment to achieve the purpose of defragmentation and improve recovery performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Fragment removing method and system based on global statistics
  • Fragment removing method and system based on global statistics
  • Fragment removing method and system based on global statistics

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.

[0050] In the cloud backup system based on data deduplication technology, the new version of duplicate data will unevenly reference the data blocks in the existing segments. Some segments have more referenced data, while some have less referenced data. If the amount of referenced data in the segment referenced by the duplicate data is small, the duplicate data is data fragmentation, which will s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a fragment removing method based on global statistics. The method includes the steps of determining all repeating data blocks in a data stream to be backed-up, counting the length of all quoted data in quoted sections corresponding to all the repeating data blocks, obtaining a section quotation buffer area, calculating the specific value of the length of all quoted data in quoted sections corresponding to all the repeating data blocks to the length of the quoted sections, judging whether the specific value is smaller than a set threshold value or not, and if yes, writing the repeating data bocks in the sections. The invention further provides a fragment removing system based on global statistics. The length of all quoted data in quoted sections corresponding to all the repeating data blocks is obtained, the section quotation rate of the quoted sections corresponding to all the repeating data blocks is calculated, the repeating data blocks corresponding to the data sections with the section quotation rate smaller than the set threshold value are judged to be data fragments, the data fragments are written in the sections, and the aim of removing the fragments so as to improving the restorability can be achieved.

Description

technical field [0001] The invention belongs to the technical field of computer information storage, and more specifically relates to a method and system for removing fragments based on global statistics, which are mainly used to remove data fragments in a cloud backup system based on data deduplication. Background technique [0002] Cloud backup system is a backup system for data centers that use third-party cloud storage services (such as Amazon S3 and Baidu Cloud Storage BCS) instead of traditional backup systems. The cloud backup system stores the data backed up by users in the third-party cloud, and the use of third-party cloud storage instead of the traditional data center has the advantages of low cost, strong scalability and high reliability. With the development of cloud storage, many backup systems and data synchronization tools that use third-party cloud storage to store data have emerged, and are becoming more and more popular. [0003] In order to improve data ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/14
Inventor 华宇冯丹赖荣誉夏文付忞黄方亭周玉坤张宇成
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products