Large-scale industrial data compression storage method, system and medium

A technology for industrial data, compressed storage, applied in file systems, file system management, electrical digital data processing, etc., can solve problems such as disk waste, reduce work time, ensure consistency, and improve work efficiency.
CN112214453BActive Publication Date: 2021-10-01上海微亿智造科技有限公司 +1

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Patents(China)
Current Assignee / Owner
上海微亿智造科技有限公司
Publication Date
2021-10-01

Smart Images

  • Figure 1
    Figure 1
Patent Text Reader

Abstract

The invention provides a large-scale industrial data compression storage method, system and medium, including: step 1: configuring different data acquisition systems according to the type of data source, and extracting the data collected by the data acquisition system through interface operation; step 2 : Define the conversion chain, and temporarily convert the format of different types of data extracted into Avro format through the data cleaning plug-in; Step 3: Compress the data in Avro format with the GPL protocol, and the compression format is snappy, and create the following in the distributed file system Parquet is a data set in a storage format, which stores compressed data. The invention can define conversion chains and compression and storage formats for any type of data, greatly improving the data processing speed and data compression ratio of the computing platform.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The present invention relates to the technical field of data compression storage, in particular to a large-scale industrial data compression storage method, system and medium. Background technique

[0002] With the vigorous development of new infrastructure, more and more traditional industrial enterprises have begun to use Internet technology to improve productivity, among which data is the most critical. In the traditional Internet, there are more and more data in big data processing, and many companies will back up two copies of data. This results in a waste of disk.

[0003] Patent document CN108304472A (application number: 201711455790.2) discloses a data compression storage method and a data compression storage device. The data compression method includes the following steps: a segmentation step, which divides the original data into multiple fields; and a compression step, based on Depending on the data content, different compression strategies...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More