Data difference analysis method based on sliding window

A data difference and analysis method technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve the problem of low accuracy and achieve the effect of reducing data redundancy

Inactive Publication Date: 2009-09-30
北京兴宇中科科技开发股份有限公司
View PDF0 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In the traditional method, the original data and the target data are usually divided into data blocks of equal size, and then the data blocks with the same content are searched in the original data and the target data. The accuracy of this method is low.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data difference analysis method based on sliding window
  • Data difference analysis method based on sliding window
  • Data difference analysis method based on sliding window

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The present invention will be further described with examples below in conjunction with the accompanying drawings.

[0040] Such as figure 1 as shown, figure 1 It is a flow chart of calculating data differences, which illustrates the steps of calculating data differences:

[0041] 1) Divide the original data into data sub-blocks of equal size;

[0042] 2) and calculate the hash value of each data sub-block in the original data respectively;

[0043] 3) Set the current processing position equal to the start position of the target data;

[0044] 4) If the size of the remaining data is less than the size of the data sub-block of the original data, turn to 10);

[0045] 5) Get a data block whose size is equal to the data sub-block size of the original data from the current processing position as the data window;

[0046] 6) Determine the matching range between the original data and the target data according to the data window.

[0047] 7) If no matching range is found...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the field of data compression, more particularly relates to a method for carrying out data difference analysis by adopting a sliding window. In a plurality of applications of a computer, if differences among different data can be analyzed, great help for reducing data redundancy and improving the processing efficiency of the computer can be brought. The method adopted by the invention comprises the steps of dividing original data into equivalent data blocks and calculating the hash value of each data block; adopting the method of the sliding window in the original data to position whether the data block which is equivalent to sliding window data exists or not; further positioning the matching range around the equivalent data block if the data block which is equivalent to the sliding window data exists and recording the matching situation into differential data; moving the sliding window and then continuously comparing if the data block which is equivalent to the sliding window data does not exist; and repeating the operations till the data is over. By adopting the method of the invention, the difference between the original data and the target data can be analyzed rapidly.

Description

technical field [0001] The invention belongs to the field of data compression, and in particular relates to a method for analyzing data differences by using a sliding window. Background technique [0002] In computer systems, there are often large amounts of data that differ only slightly from each other during communication and storage. For example, a user may modify a document multiple times and save it as different files multiple times during the modification process. [0003] The differences between these different files are very small, but the computer system must keep a copy of each file, which wastes a lot of storage space. If such files are transmitted on the network, data with very small differences are transmitted each time on the network, which also wastes network bandwidth. [0004] If we can separate the difference between different data, and only process the difference, this will greatly improve the processing efficiency of the computer. For example, for a s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 林兆祥
Owner 北京兴宇中科科技开发股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products