Methods and apparatus for content-defined node splitting

A node and defined technology, applied in special data processing applications, instruments, file access structures, etc., can solve problems that do not contain the same items, are not, etc.

Inactive Publication Date: 2010-03-31
NEC LAB AMERICA
View PDF3 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, nodes created for different split points will not contain the same items, they will not be exact duplicates in these two data structures

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Methods and apparatus for content-defined node splitting
  • Methods and apparatus for content-defined node splitting
  • Methods and apparatus for content-defined node splitting

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] Content Addressable Storage (CAS) systems store information that can be retrieved based on content rather than location. figure 1 is the storage system diagram 100 . In at least one embodiment of the present invention, the method for node splitting described herein is performed in a storage system such as the storage system 100 . Implementations of such storage systems are described in more detail in related US Patent Application No. US 12 / 042,777, entitled "System and Method for Content-Addressable Storage," filed March 5, 2008 , and is incorporated herein by reference.

[0018] The storage system 100 includes a file server 102, which is used to receive data operations (such as file writing, file reading, etc.) and metadata operations (such as file removal, etc.), and block the received data into data blocks (data blocks) to be stored in the block store (blockstore) 104. The block store 104 stores data and metadata blocks, some of which can point to other blocks, wh...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A region of a node is searched to find a content-defined split point. A split point of a node is determined based at least in part on hashes of entries in the node and the node is split based on the determined split point. The search region is searched for the first encountered split point and the node is split based on that split point. That split point is based on a predetermined bitmask of thehashes of the entries in the node satisfying a predetermined condition.

Description

technical field [0001] The present invention relates generally to node splitting in data structures, and more particularly to content-defined node splitting in data structures. Background technique [0002] In traditional backup systems, large amounts (eg, terabytes) of input data must be indexed and stored. A data structure such as a tree structure is used to store metadata (eg nodes, indexes of underlying data, etc.) related to data (eg directories, files, data sequences, data chunks, etc.). In backup systems for large file systems, these data structures arrange chunks of file data of uniform or varying sizes in an ordered sequence. That is, the underlying file data is a sequence of byte chunks from the input stream with associated file offsets, and the metadata tree arranges the addresses of the chunks into an ordered sequence. In this way, the location of the underlying data, and as such, the auxiliary file and directory related information is persisted to enable retri...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F17/30067G06F17/30091G06F16/13
Inventor E·克鲁斯C·昂古里努S·戈克黑尔A·阿兰亚S·A·拉戈
Owner NEC LAB AMERICA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products