Method, device and system for searching repeated data

A technology for duplicating data and data, applied in the storage field, can solve the problems of reducing system performance and occupying a large link overhead, and achieve the effect of improving system performance and reducing link overhead

Inactive Publication Date: 2012-06-13
HUAWEI DIGITAL TECH (CHENGDU) CO LTD
View PDF3 Cites 49 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The inventor found in the research that in the prior art, in the process of searching for duplicate data blocks, when the subdivision block data is not stored in the database, the subdivision block data needs to be sent to the node that manages the

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, device and system for searching repeated data
  • Method, device and system for searching repeated data
  • Method, device and system for searching repeated data

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0025] In order to make the purposes, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments These are some embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0026] figure 1 For an embodiment of the method for repeating data search according to the present invention, refer to figure 1 , the method of this embodiment may include:

[0027] Step 100, the file is divided into blocks, and the metadata information of each block data is generated, wherein the metadata information of the block...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

An embodiment of the invention provides a method, a device and a system for searching repeated data. After a repeated-deleting file is partitioned into blocks, fingerprint information of block data is utilized to determine a metadata server responsible for search. When the fingerprint information of the block data is found to be stored in a database by search, the block data corresponding to the fingerprint information of the block data are transmitted to a shared-file system and are not required to be transmitted to the metadata server for storage. Compared with the prior art, the method, the device and the system for searching the repeated data reduce link cost to a large extent and improve system performance.

Description

technical field [0001] Embodiments of the present invention relate to storage technologies, and in particular, to methods, devices, and systems for searching for duplicate data. Background technique [0002] Data deduplication (hereafter referred to as "deduplication"), also known as intelligent compression or single instance storage, is a method of automatically searching for duplicate data, keeping only one copy of the same data, and replacing others with pointers to the single copy Duplicate copies to achieve a storage technology that eliminates redundant data and reduces storage capacity requirements. [0003] In data deduplication, the search for duplicate data is undoubtedly an important indicator of deduplication performance. In the prior art, in order to improve the search efficiency of duplicate data, the following methods are used: [0004] Divide the file to be deduplicated based on the content to obtain the first block, and then subdivide the first block to obt...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F17/30159G06F17/30G06F16/1752
Inventor 黄焰谢勇
Owner HUAWEI DIGITAL TECH (CHENGDU) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products