Method and device for optimizing data placement to reduce data fragments

A technology for optimizing data and data, which is applied in the direction of data error detection, electrical digital data processing, and special data processing applications to achieve enhanced redundancy locality, enhanced data redundancy, and data redundancy locality. sex enhancing effect

Inactive Publication Date: 2013-03-27
CHONGQING UNIV
View PDF5 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The technical problem to be solved by the present invention is to reduce the non-sequential placement of data and data fragmentation, alleviate th

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for optimizing data placement to reduce data fragments
  • Method and device for optimizing data placement to reduce data fragments

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] Below in conjunction with accompanying drawing and embodiment the present invention will be further described:

[0031] The main body involved in the present invention is a backup server and a storage server, the backup server provides the data to be backed up, and the storage server stores the data to be backed up. The lookup and deletion of duplicate data is performed in the storage server.

[0032] figure 1 It is a flow chart of the method for reducing data fragmentation by optimizing data placement in the present invention; the process starts at S101.

[0033] In step S102, each file to be backed up is divided into data blocks, such as using a data block variable length algorithm to carry out data block, and the average data block size is obtained as a quantitative data block to be backed up, such as a data block with a data volume of 8KB; And calculate the data block fingerprint for each data block to be backed up. The algorithm of the data block fingerprint can ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method and a device for optimizing data placement to reduce data fragments. The method comprises the following steps of: carrying out data partitioning on each file to be backed up, and determining a data block fingerprint of each data block to be backed up; organizing a plurality of continuous data blocks to be backed up into a data segment to be backed up; searching whether the data block same as that backed up by the backed-up data segment in the system as to each data block to be backed up in the data segment to be backed up, if not, judging the data block to be a non-repeated data block, entering a data reading and writing step, if so, judging the data block to be a repeated data block, and entering the next step; calculating the data redundancy locality of the data segment to be backed up and the backed-up data segment, and quantifying the data redundancy locality, if the value of the data redundancy locality is smaller than a preset threshold, entering the data reading and writing step, or else, entering the next step; and deleting the repeated data block shared by the data segment to be backed up and the backed-up data segment from the data segment to be backed up. According to the method disclosed by the invention, non-sequenced placement of the data and the data fragment are reduced; deterioration of the data fragment is slowed down under the premise of sacrificing a little of data compression ratio; and the reading and writing performance of the system is improved.

Description

technical field [0001] The invention belongs to the technical field of computer information storage, and in particular relates to a method and a device for reducing data fragmentation by optimizing data placement. Background technique [0002] Data deduplication is an advanced data lossless compression technology, which is mainly used to save the storage space required in the information storage backup system. The basic principle it implements is to cut each file into multiple consecutive data blocks in turn, and delete duplicate data blocks within a single file or between multiple files to reduce data storage space. Most of the existing information storage and backup systems use this technology to optimize storage space and save data storage and management costs. [0003] In an information storage and backup system that uses deduplication technology (referred to as a deduplication system for short), there are mainly two types of data blocks. One is new data blocks that ne...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F11/14
Inventor 谭玉娟沙行勉晏志超诸葛晴凤刘铎
Owner CHONGQING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products