Method and system for reallocating data in a file system

a file system and data technology, applied in the field of network-based storage systems, can solve the problems of limiting the number of read operations required, limiting the seek time required by the reading component, and limiting the availability of physical contiguous space for writing new files

Active Publication Date: 2008-08-05
NETWORK APPLIANCE INC
View PDF12 Cites 78 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Consequently, when the file is read from the disk or mass storage device, the individual data blocks can be read sequentially, thereby limiting the number of read operations required, which in turn limits the seek time required by the reading component (e.g., disk drive head) to search for and locate the individual data blocks comprising the file.
However, over time, as existing files are modified and deleted, and new files are written, the availability of physical contiguous space for writing new files becomes limited.
Consequently, files tend to become fragmented.
When the files of a file system become fragmented, multiple read operations are required to read the files, and the overall performance of the system suffers.
For a variety of reasons, fragmentation issues may arise with network-based storage systems that implement write anywhere file systems, such as the WAFL® (Write Anywhere File Layout) file system developed by Network Appliance of Sunnyvale, Calif.).
Consequently, with a write anywhere file system, certain classes of workloads are likely to endure performance degradation due to fragmentation issues.
For instance, performance levels associated with workloads that are characterized by a large number of random writes followed by a sequential read tend to suffer from fragmentation issues.
For example, during a work day, a large file that makes up an individual user's email mailbox may be modified several times, causing the large file to become severely fragmented.
However, due to the fragmentation caused by the user's activity throughout the day, the system may have a difficult time sequentially reading the mailbox file.
However, the defragmentation operation does not increase the number of allocated disk blocks.
Traditional defragmentation applications may be ineffective when used with certain network-based storage systems that implement write anywhere file systems.
In particular, the WAFL® file system has certain characteristics, which render traditional defragmentation tools less useful.
However, because data blocks “SN1”, “A”, “B”, “C” and “D” are part of the read-only Snapshot, those data blocks cannot be moved or rewritten.
Consequently, a traditional defragmentation application, which might move the data blocks of the active file system to a new location, would be ineffective because it would nearly double the number of allocated blocks.
Furthermore, in an environment with one or more network-based storage systems, there are often multiple file systems involved.
However, by improving the logical arrangement, the locally executed defragmentation application may cause greater fragmentation of the data blocks as physically stored on the mass storage device.
Consequently a defragmentation application executed locally on the client may actually make matters worse in certain instances.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for reallocating data in a file system
  • Method and system for reallocating data in a file system
  • Method and system for reallocating data in a file system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024]A method and system for reallocating the data blocks of a logical data entity are described. In the following description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the present invention. It will be evident, however, to one skilled in the art that the present invention may be practiced without these specific details.

[0025]As described in greater detail below, in certain embodiments of the invention, a reallocation method includes analyzing the physical arrangement of the data blocks making up a particular logical data entity (e.g., a file, volume or logical unit number (LUN)) to determine a fragmentation level associated with the logical data entity. If the fragmentation level exceeds a fragmentation threshold that has previously been assigned to the logical data entity, then an attempt is made to physically rearrange the data blocks in order to reduce the overall level of fragmentation. Accordingly, at...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A method and system for reallocating the data blocks of a logical data entity are described. According to one aspect of the invention, the physical arrangement of data blocks of a logical data entity are analyzed to determine a fragmentation level. Next the fragmentation level is compared with a fragmentation threshold previously assigned to the logical data entity. If the fragmentation level exceeds the fragmentation threshold, only those data blocks of the logical data entity that are within a predefined allocation area having a predetermined number of contiguous data blocks that are not associated with the logical data entity are reallocated.

Description

FIELD OF THE INVENTION[0001]The present invention relates generally to network-based storage systems, and in particular, to a method and system for reallocating data blocks of a logical data container, such as a file, volume or LUN, in an active file system of a network-based storage system with write anywhere capabilities.BACKGROUND[0002]In a typical modern file system, when a file is initially written to a disk or other mass storage device, the file is divided into equally-sized, logical data blocks, and an attempt is made to write the logical data blocks sequentially and contiguously to the disk or mass storage device. Consequently, when the file is read from the disk or mass storage device, the individual data blocks can be read sequentially, thereby limiting the number of read operations required, which in turn limits the seek time required by the reading component (e.g., disk drive head) to search for and locate the individual data blocks comprising the file. However, over tim...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(United States)
IPC IPC(8): G06F12/00
CPCG06F3/061G06F3/0638G06F12/023G06F3/067G06F2206/1004
Inventor FAIR, ROBERT L.EDWARDS, JOHN K.
Owner NETWORK APPLIANCE INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products