Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A deduplication method, device, equipment and readable storage medium

A technology for deduplication and address storage, applied in the field of all-flash storage, can solve the problems of inability to HASH data cache, reduce the overall performance of the storage system, etc., achieve the effect of high overall performance, improve deduplication performance, and avoid excessive occupation

Active Publication Date: 2021-10-15
SUZHOU METABRAIN INTELLIGENT TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This can effectively reduce the writing of duplicate data to the disk and greatly improve the effective utilization of the storage system. However, frequent HASH reading of the disk will greatly increase the delay of writing data and reduce the overall performance of the storage system.
To solve the above problems, there is a way to cache all HASH data, but the current memory size cannot support enough HASH data for all caches

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A deduplication method, device, equipment and readable storage medium
  • A deduplication method, device, equipment and readable storage medium
  • A deduplication method, device, equipment and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] The core of the present invention is to provide a deduplication method, which can reduce unnecessary disk access operations and unnecessary memory overhead.

[0051]In order to enable those skilled in the art to better understand the solution of the present invention, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments. Apparently, the described embodiments are only some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0052] Please refer to figure 1 , figure 1 It is a flowchart of a deduplication method in an embodiment of the present invention, and the method includes the following steps:

[0053] S101. After receiving the IO write request, determine the IO ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a deduplication method, which converts the comparison of deduplication data into a two-level comparison, and stores a part of the eigenvalues ​​written in the IO, that is, the first part of the eigenvalues ​​in the cache, and the written The other part of the eigenvalues ​​of the IO, that is, the second part of the eigenvalues ​​is stored in the disk to avoid excessive occupation of the cache space; when comparing, the first part of the eigenvalues ​​is first compared, and only the first part of the eigenvalues ​​is hit Only when the first part of the eigenvalue comparison is not hit, then the second part of the eigenvalue comparison is discarded and the data is written directly. This two-level comparison The deduplication performance can be improved by caching pre-screening, and unnecessary memory overhead can be reduced by caching and post-allocation. The overall performance of the system is high. The invention also discloses a deduplication device, equipment and a readable storage medium, which have corresponding technical effects.

Description

technical field [0001] The present invention relates to the technical field of all-flash storage, in particular to a deduplication method, device, equipment and readable storage medium. Background technique [0002] At present, the all-flash storage system has gradually become the mainstream storage device of major operators and financial institutions. In order to increase the total data storage capacity under the premise of constant capacity, major manufacturers have supported the deduplication (deduplication) feature. Deduplication is a technology to save storage space. Usually, there are many duplicate data in the data storage pool. Deduplication is a technology to find and process these duplicate data. Simply put, deduplication is to delete Only one copy of N duplicate data is kept, and the address pointer of N-1 data points to the only one. Deduplication saves customer costs, but introduces a lot of overhead that does not exist in traditional storage for the system, su...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F3/06
CPCG06F3/061G06F3/0641G06F3/0656G06F3/0679
Inventor 甄凤远刘志勇
Owner SUZHOU METABRAIN INTELLIGENT TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products