Method for storing massive time series data on basis of Hadoop technologies

A time-series data, massive technology, applied in the direction of electrical digital data processing, structured data retrieval, special data processing applications, etc., can solve the problems that cannot be used as an enterprise time-series data storage platform, time-series data is not synchronized at all times, and cannot be satisfied. Achieve the effects of improving query efficiency and speed, increasing storage space and memory space, and reducing storage space and memory usage

Active Publication Date: 2017-05-17
SHANDONG LUNENG SOFTWARE TECH
View PDF3 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to breaking the continuity of time series data, it is very inefficient to access the data of a single measuring point or a group of temporarily organized measuring points; if the time series data of the same measuring point is used in multiple calculations or applications, multiple copies need to be stored data
There are many production processes on the production site of the enterprise, and the sources of time series data are relatively scattered. It is difficult to ensure that many control systems collect time series data synchronously. The collected time series data is not synchronized at all times. The requirements of the enterprise production site time series data platform can only be used as time series data storage for special calculations or applications, and cannot be used as a general enterprise time series data storage platform

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for storing massive time series data on basis of Hadoop technologies
  • Method for storing massive time series data on basis of Hadoop technologies
  • Method for storing massive time series data on basis of Hadoop technologies

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The concrete implementation of the present invention is described in detail below, it is necessary to point out here that the following implementation is only used for further description of the present invention, and can not be interpreted as limiting the protection scope of the present invention. Some non-essential improvements and adjustments still belong to the protection scope of the present invention.

[0040] The present invention provides a kind of massive time-series data storage realization method based on Hadoop technology, specifically comprises the following steps:

[0041] 1), establish HBase primary key storage primary key design scheme model

[0042] a. Create a measuring point information table in HBase, which records all relevant measuring points of the time series database storing time series data in the measuring point information table, but other information except time series data;

[0043] b. The measuring point information in the measuring point...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for storing massive time series data on the basis of Hadoop technologies. The method includes steps of building Hadoop primary key storage and primary key design scheme models; building HBase data row storage structural design scheme models; accessing the time series data and the like. The method has the advantages that the method is low in implementation cost, storage spaces and memory occupation required for storing the data can be reduced to a great extent, and the storage efficiency can be improved.

Description

technical field [0001] The invention relates to the field of centralized storage and calculation processing of massive continuous process parameter data generated by the production site of a large-scale production process automation industrial enterprise / group, belongs to the technical field of time series data storage, and specifically relates to a massive time series data storage based on Hadoop technology Implementation. Background technique [0002] Electricity, petroleum, chemical industry, new energy and other asset-heavy production process automation industries are the main production sectors related to the national economy and people's livelihood, and their main production processes need to be carried out continuously. Due to the complexity of the production process, there are many process parameters (time-series data) that need to be controlled and measured in the production process, and the amount of time-series data that needs to be recorded is very large, and the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/2282G06F16/27
Inventor 李海斌丁书耕李秀芬张华伟潘爱兵陈勇
Owner SHANDONG LUNENG SOFTWARE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products