Unlock instant, AI-driven research and patent intelligence for your innovation.

A columnar storage and query method and system for time series data

A time-series data and columnar storage technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve the problems of time-series data optimization, the inability to store data values ​​and timestamps at the same time, and the inability to do indexing at the same time, to achieve The effect of reducing disk I/O and speeding up query speed

Active Publication Date: 2018-12-11
TSINGHUA UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Currently, mainstream columnar storage file formats such as Parquet and ORCFile are not optimized for the storage of time-series data. When storing time-series data in a columnar format, they all adopt a more general columnar storage method, which has complex The nested structure of a set of time-series data, and the data value and timestamp of a set of time-series data cannot be stored in a column at the same time, and the timestamp and data value of a set of time-series data cannot be indexed in a column of data at the same time. These problems will slow down the time series Data query speed consumes more system resources

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A columnar storage and query method and system for time series data
  • A columnar storage and query method and system for time series data
  • A columnar storage and query method and system for time series data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.

[0043] see figure 1 , provides a columnar storage method for time series data according to an embodiment of the present invention, which can simultaneously store the time stamps and specific data values ​​of all data points of a time series in one column, and improve the query efficiency of time series data. The method includes: dividing a column of time series data into a plurality of pages, each page stores a part of data points of the column of time series data, and the sum of data points stored in all pages is all data points in a column of time series data; for each page Set the page header and page body. For each page, store the aggregated index information of all d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a columnar storage and query method and system for time-series data. The storage method includes: dividing a column of time-series data into multiple pages, and each page stores a part of data points of the column of time-series data. The sum of the stored data points is all the data points in a column of time series data; set the page header and page body for each page, and for each page, store the aggregated index information of all the data points in the page in the page of the page header, and store the data value information of all data points in the page in the page body of the page. Through the present invention, the time stamp and specific data value of a group of time series data can be stored in a column, which reduces the disk I / O during data query; and divides the data according to pages, and establishes aggregation index information respectively, Speed ​​up the query speed of data.

Description

technical field [0001] The present invention relates to the technical field of data storage, and more specifically, to a columnar storage and query method and system for time series data. Background technique [0002] With the development of computer technology and industrial informatization, the amount of data generated in the industrial field is increasing. Time series data (referred to as "time series data") is widely used in the industrial field and is the main data type in industrial data. As the main data format in industrial big data, its storage and query issues have become a key content in the research of industrial big data. [0003] Time-series data is usually generated by machine sensors at a certain frequency, and once generated, there is almost no need to modify it; secondly, the application of time-series data is more for analytical queries rather than fine-grained additions, deletions, and modifications. Therefore, combined with the structural characteristi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F16/221G06F16/2474G06F16/2477G06F16/2282G06F16/248
Inventor 王建民黄向东张金瑞康荣王晨龙明盛
Owner TSINGHUA UNIV