Data classification storage method, device and system

A storage device and data classification technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as wasting system systems, sacrificing hardware efficiency, and large data migration, to achieve accurate hierarchical storage and improve system performance , the effect of reducing workload

Active Publication Date: 2013-07-03
HANDAN BRANCH OF CHINA MOBILE GRP HEBEI COMPANYLIMITED
View PDF3 Cites 41 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0014] (1) Sacrificing hardware efficiency and bringing data redundancy;
[0015] (2) Increase the complexity of the system, bring huge data migration, and waste the entire system;
[0016] (3) Decisions are made entirely based on business experience and subjective judgment, and there is also a lack of objective calculation basis for the results of hierarchical storage implementation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data classification storage method, device and system
  • Data classification storage method, device and system
  • Data classification storage method, device and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0058] Data popularity, that is, the frequency with which data is relied upon and used in Extraction Transformation Loading (ETL) production, is used to evaluate the activity of data in the database.

[0059] After the data warehouse system is expanded and constructed, there are different types of storage in the same computing cluster, and the input / output (I / O) performance of these storages is different. Among them, the I / O performance of the disk array has a great constraint on the overall performance of the On-Line Analytical Processing (OLAP) database system, which is different from the On-Line Transaction Processing (OLTP) system. The system, its I / O throughput, bandwidth, number of high-speed disks, magnetic array I / O exports and other indicators are very important, and even affect the processing capacity of the entire data warehouse.

[0060] For a data warehouse system with a massively parallel processing (MPP) architecture, following the barrel principle, the performance s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses data classification storage method, device and system, wherein the method comprises the steps: acquiring the use frequency of a data table from an ETL (Extraction Transformation Loading) schedule table, and acquiring the appearance frequency of the data table from a buffer pool; according to the usage frequency and appearance frequency of the data table, calculating the heat of the data table; and according to the level of the data table, performing classification storage on the data table. The heat of the data table is calculated according to the usage frequency of the data table, acquired from the ETL schedule table, and the occurrence frequency in the buffer pool, so that the evaluation on the data table is more accurate, the classification storage is more precise to meet the practical condition of the data table, and the system performance can be improved. Different from the traditional method of carrying out data partitioning according to time slices, data in the same one data table can be stored by means of multiple types, the data table can be used for storing data on various performances according to the condition of heat difference, the workload of shifting a great deal of data in each day can be reduced, and the usage rate of the system performance can be promoted.

Description

Technical field [0001] The present invention relates to a business support technology, in particular to a data hierarchical storage method, device and system. Background technique [0002] Hierarchical storage is a commonly used strategy for efficient storage utilization in today’s data warehouse projects. Its main intention is to take into account the energy consumption input and output consumption ratio without reducing efficiency. It is the most cost-effective storage strategy . Hierarchical storage is based on a high-efficiency and low-cost idea, and the pursuit of the highest cost-effective input and output. In the business analysis system, the advantages of introducing hierarchical storage technology are: [0003] 1. Reduce overall storage costs: Infrequently accessed data resides in lower-cost storage devices, which can comprehensively give play to the performance advantages of high-performance storage devices and the cost advantages of low-priced storage devices; [0004] ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 易剑光霍绍博蒋瑞文曹健王海通王娜姚春芬岳瑞杨洁
Owner HANDAN BRANCH OF CHINA MOBILE GRP HEBEI COMPANYLIMITED
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products