Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Data block construction and comparison method and device, medium and equipment

A construction method and data block technology, applied in the field of data block construction and comparison, can solve the problems of high time cost and space cost, and large time consumption

Pending Publication Date: 2021-04-16
BEIJING BAISHANCLOUD TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Segmentation of large data blocks and calculation of hash fingerprints will consume a lot of time, and the time and space costs for comparison are very high. For general enterprises, such costs are almost unacceptable

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data block construction and comparison method and device, medium and equipment
  • Data block construction and comparison method and device, medium and equipment
  • Data block construction and comparison method and device, medium and equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] In order to make the purpose, technical solutions and advantages of the embodiments of this paper clearer, the technical solutions in the embodiments of this paper will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of this paper. Obviously, the described embodiments are the Some, but not all, embodiments. Based on the embodiments herein, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts fall within the scope of protection herein. It should be noted that, in the case of no conflict, the embodiments herein and the features in the embodiments can be combined arbitrarily with each other.

[0050] figure 1 It is a flow chart of a data block construction method according to an exemplary embodiment. refer to figure 1 , the data block construction methods include:

[0051] Step S11, determine N sub-data blocks according to the comparison task, and fill the N sub...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a data block construction and comparison method and device, a medium and equipment. The method comprises the steps of determining N sub-data blocks according to a comparison task, and filling the data blocks with the N sub-data blocks; generating N hash fingerprints in one-to-one correspondence with the contents of the N sub-data blocks; and adding the N hash fingerprints into the data blocks. When data block similarity comparison is carried out, hash fingerprints or hash fingerprint lists in a plurality of to-be-compared data blocks are directly extracted, and similarity coefficients of the plurality of data blocks are determined based on the hash fingerprints or hash fingerprint lists, so that the process of segmenting big data and calculating the hash fingerprints is avoided, the calculation time is saved, and the efficiency is improved.

Description

technical field [0001] This article relates to distributed storage, in particular to a data block construction and comparison method, device, medium and equipment. Background technique [0002] In related storage technologies, data blocks (Oracle Data Blocks) are the smallest storage units, and data is stored in "data blocks", and a data block occupies a certain amount of disk space. [0003] In the process of using data block storage, there is usually a scenario of comparing whether the contents of two data blocks are highly similar. In order to compare the similarity of two data blocks, it is generally used in the prior art: the data block is cut into blocks in a certain way, the hash fingerprint is calculated for each small block of data, and then the limited sample set (that is, the data block) is compared using the similarity coefficient. The similarity and difference between the hash fingerprint set of the small data block corresponding to the block), the larger the c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F3/06G06F16/22
Inventor 李文博吴义谱
Owner BEIJING BAISHANCLOUD TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products