Compression method for repeated data

A technology of repeated data and compression methods, which is applied in the fields of electrical digital data processing, special data processing applications, instruments, etc. The effect of shortening the compression time

Inactive Publication Date: 2008-12-10
EISOO SOFTWARE
View PDF0 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the disadvantage of this technology is that the storage structure of different types of files is completely different, for example, the storage structure of a text file is stored in the original content mode, while the file generated by word processor software is based on the semi-structure of the object. However, the database files generated by the database system are saved in a block-like structured way. If the data to be processed is segmented with a fixed length regardless of the storage structure differences of different types of data, then the data after judging When the block is repeated data, the recognition rate is often not high, which leads to the unsatisfactory compression rate of repeated data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Compression method for repeated data
  • Compression method for repeated data
  • Compression method for repeated data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0016] The present invention will be further described below in conjunction with the accompanying drawings.

[0017] figure 1 It is a flow chart for comparing the differences between files with the same name of a certain type, referred to as the flow chart of the comparison program. The steps shown in the figure are used to compare the change rules of a certain type of files with the same name, that is, which parts have changed and which parts have not changed, and such change rules are expressed as a data block change table, and at the same time in the data block change table. A storage area is designated on the memory of the target computer for saving the compared files and the corresponding data block change table.

[0018] figure 1 The specific steps of the comparison procedure shown are as follows:

[0019] For the file to be compared, first obtain the file type of the file, and the file type can be judged by the file extension or the file control information in the fi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention discloses a computer duplicated data compressing method, which can to great extent increase the compression rate of the duplicated data in a short time. The present invention is achieved through the following procedures: firstly, a comparison program compares similarities and differences when homonymous files of same type change and acquires a corresponding data block change form; then, an analysis program analyzes all data block change forms of files of same type, gets the optimal splitting mode of the file and stores a type splitting database; finally, the optimal splitting mode of the file in the type splitting database is utilized to compress the duplicated data in the file to be processed when the duplicated data of certain file type needs compressing, thus achieving the minimum compression rate.

Description

technical field [0001] The invention relates to a method for compressing repeated data in a computer, in particular to a method for improving the compression rate of repeated data in computer data storage, archiving and backup. Background technique [0002] At present, with the improvement of the degree of informatization in our country, more and more enterprises, institutions and organizations use the establishment of the computer local area network of their own units to enable their staff to better share information and work together. However, using the network office environment, it is often A piece of electronic data with the same content is saved in multiple computers with the same or different file names or file forms (such as emails, work documents, etc.), and the data of all clients in a local area network is usually regularly Centralized archiving, storage or backup in the server. This can result in a large amount of completely duplicated data during archiving, sto...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 贺鸿富
Owner EISOO SOFTWARE
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products