Data compression storage method and data compression storage apparatus

A data compression and compression storage technology, applied in the field of data processing, can solve the problems of not fully considering the characteristics of enterprise data, and the data compression efficiency is not very high, so as to achieve the effect of improving data compression efficiency and data compression rate

Pending Publication Date: 2018-07-20
CHINA UNIONPAY
View PDF5 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, as mentioned above, the compression tools currently used by enterprises for data storage are common tools for all data. For enterprises, the characteristics of enterprise data are not fully considered.
Therefore, the data compression efficiency is not very high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data compression storage method and data compression storage apparatus
  • Data compression storage method and data compression storage apparatus

Examples

Experimental program
Comparison scheme
Effect test

no. 1 approach

[0060] The first implementation mode relates to an implementation mode in which an optimized compression strategy is used for each field for compressed storage.

[0061] First, the data compression storage method of the first embodiment will be described.

[0062] The data compression storage method of the first embodiment includes a segmentation step and a compression step, wherein the segmentation step is the same as the above-mentioned segmentation step S100, and the compression step specifically includes the following sub-steps:

[0063] Perform content analysis on data cut into multiple fields, and establish associations between fields; and

[0064] For a single field, different compression strategies are used for compressed storage for different data contents.

[0065] For example, as a compression strategy, use relevant and optimized data compression storage methods for different data contents; for example, enumeration type uses binary storage, string value conversion ...

no. 2 approach

[0073] The second embodiment relates to an embodiment using an optimized compression strategy for multiple fields.

[0074] First, the data compression storage method of the second embodiment will be described.

[0075] The data compression storage method of the second embodiment includes a segmentation step and a compression step, wherein the segmentation step is the same as the above segmentation step S100, and the compression step specifically includes the following sub-steps:

[0076] Perform content analysis on data cut into multiple fields, establish data distribution diagrams and correlation diagrams between fields, and identify correlations between data fields based on data distribution diagrams and correlation diagrams; and

[0077] Combining multiple fields that have correlations, for the combined fields, use different compression strategies for different data content to compress and store.

[0078] For example, as a compression strategy, multiple fields are combine...

no. 3 approach

[0086] The third embodiment relates to an embodiment in which an optimized compression strategy is used for both a single field and a combination of multiple fields.

[0087] First, the data compression storage method of the third embodiment will be described.

[0088] The data compression storage method of the third embodiment includes a segmentation step and a compression step, wherein the segmentation step is the same as the above segmentation step S100, and the compression step specifically includes the following sub-steps:

[0089] Perform content analysis on data cut into multiple fields, establish data distribution diagrams and correlation diagrams between fields, and identify correlations between data fields based on data distribution diagrams and correlation diagrams; and

[0090] For a single field, different compression strategies are used for compression storage for different data contents, and multiple fields with correlations are also combined, and for the combin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a data compression storage method and a data compression storage apparatus. The data compression method comprises the following steps: a segmentation step of segmenting original data into multiple fields; and a compression step of compressing different fields by adopting different compression strategies based on different data contents and storing the compressed data. According to the data compression storage method and the data compression storage apparatus, different compression methods are adopted for different data contents, so that the data compression efficiencyis effectively improved; and compared with universal data compression tools such as GZIP, SNAPPY and the like, the data compression rate is remarkably improved.

Description

technical field [0001] The invention relates to data processing technology, in particular to a data compression storage method and a data compression storage device. Background technique [0002] When enterprises store data, they generally compress and store data in order to save storage space and improve reading efficiency. However, general-purpose compression tools target all data. [0003] Furthermore, existing common data compression tools include GZIP, SNAPPY, etc., which are designed to compress general data. [0004] However, as mentioned above, the compression tools currently used by enterprises for data storage are common tools for all data. For enterprises, the characteristics of enterprise data are not fully considered. Therefore, the data compression efficiency is not very high. Contents of the invention [0005] In view of the above problems, the present invention aims to propose a further data compression storage method and a data compression storage devic...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/1744G06F40/289
Inventor 何东杰
Owner CHINA UNIONPAY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products