Unlock instant, AI-driven research and patent intelligence for your innovation.

Apparatus, system, and method for improved data deduplication

A technology for data strips and data segments, which is applied in the field of hash generation and can solve problems such as unavailability of storage device 120

Active Publication Date: 2012-07-18
SANDISK TECH LLC
View PDF11 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Furthermore, asynchronous deduplication may render storage device 120 unusable while the file is being read, even when more urgent processes require access to storage device 120

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Apparatus, system, and method for improved data deduplication
  • Apparatus, system, and method for improved data deduplication
  • Apparatus, system, and method for improved data deduplication

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] Many functional units described in this specification have been marked as modules in order to emphasize the independence of their implementation. For example, modules can be implemented as hardware circuits including custom VLSI circuits or gate arrays, finished semiconductors such as logic chips, transistors, or other discrete components. Modules can also be implemented in the form of programmable hardware devices, such as field programmable gate arrays, programmable array logic, programmable logic devices, etc.

[0037] The modules can also be implemented as software stored on a computer-readable storage medium to be executed by various types of processors. In certain embodiments, the modules can also be implemented in the form of firmware. For example, an equivalent module of executable code may include one or more physical or logical computer instruction blocks stored on a computer-readable storage medium, wherein the one or more physical or logical computer instructi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

An apparatus, system, and method are disclosed for improved deduplication. An input module 312, hash module 314, and transmission module 316 are implemented in a nonvolatile storage device 210. The input module 312 receives hash requests 302 from requesting entities internal or external to the nonvolatile storage device 210. Hash requests include a data unit identifier that identifies the data unit for which the hash is requested. The hash module 314 generates a hash for the data unit using a hash function. The hash is generated using computing resources of the nonvolatile storage device 210. The transmission module 316 sends the hash to a receiving entity when the input module 312 receives the hash request. A deduplication agent 110 uses the hash to determine whether or not the data unit is a duplicate of a data unit already stored in the storage system that includes the nonvolatile storage device 210.

Description

Technical field [0001] The present invention relates to data deduplication. Specifically, the present invention relates to the timing of deduplication operations and the generation of hashes for such operations. Background technique [0002] Data deduplication operations generally involve the removal of redundant data in the storage system. Data deduplication can provide considerable benefits in any system, but it is especially valuable in large enterprise storage systems. For example, if a large email is sent to multiple individuals in a company in the form of an email attachment, storing a copy of the large file for everyone receiving the email is a waste of storage space. It is best to store a single copy of the file and have the pointer direct all recipients to that single copy. For users, removing redundant data from a system (whether the system is a single drive, storage area network ("SAN"), network attached storage ("NAS"), or other storage system) provides many benefi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F21/24G06F15/16G06F13/14G06F9/06
CPCG06F3/0641G06F3/0679G06F3/0689G06F2212/214G06F3/0608G06F12/0866
Inventor 乔纳森·希尔达大卫·弗林约翰·斯特拉瑟
Owner SANDISK TECH LLC