Method of constructing binary electric-power sequential data index based on Geohash

A time-series data and construction method technology, which is applied in the fields of electronic digital data processing, structured data retrieval, and special data processing applications, etc., can solve problems such as reduced query efficiency, inability to retain multivariate time-series information, and uneven data distribution

Inactive Publication Date: 2017-10-20
SHANGHAI MUNICIPAL ELECTRIC POWER CO +2
View PDF4 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The current method based on space division retains the approximate information of the multivariate time series, but basically divides the space fixedly. For the index, the data distribution may not be uniform, which will reduce the query efficiency
Indexes based on feature compression find similar time series through dimensionali

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method of constructing binary electric-power sequential data index based on Geohash
  • Method of constructing binary electric-power sequential data index based on Geohash
  • Method of constructing binary electric-power sequential data index based on Geohash

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0032] Aiming at the deficiencies of the traditional method, the present invention proposes a Geohash-based binary power time series data index construction method, BTSAX (Multivariate Timeseries Symbolic Aggregate approximation). This method divides the original power data time series into several equal-length sub-segments, and each sub-segment adopts Geohash code, and the Geohash codes of all sub-segments form a symbol string, which is BTSAX representation, and similar binary time series have the same representation. For massive binary time series data, an index structure based on HBase storage is designed, which can quickly respond to similar queries. BTSAX can not only dynamically divide the space according to the amount of data, but also retain the original binary time series information under the specified precision.

[0033] Specifically include the following steps:

[0034] (1) Generation of binary time series symbolic representation BTSAX

[0035] 1) The present inv...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method of constructing a binary electric-power sequential data index based on Geohash. The method comprises the following steps of 1, obtaining original binary electric-power sequential data, and conducting dimensionality reduction on the data; 2, conducting Geohash coding on the binary electric-power sequential data after the dimensionality reduction, and obtaining a BTSAX expression of the binary electric-power sequential data; 3, constructing the binary electric-power sequential BTSAX data index according to the BTSAX expression, and adopting an HBase database to store the original binary electric-power sequential data and the binary electric-power sequential data index. Compared with the prior art, the method of constructing the binary electric-power sequential data index based on the Geohash has the advantages of dynamic partition, assignable precision, non-overlapping nodes and the like.

Description

technical field [0001] The invention relates to the field of power data processing, in particular to a Geohash-based binary power time series data index construction method. Background technique [0002] User electricity load data is a kind of massive time series data, which has the characteristics of large user scale, high data collection density, and close correlation with a large amount of economic and social data. Time series data indexing technology is very important to reduce the time cost of data query and retrieval and improve the efficiency of time series mining (such as classification, clustering, outlier monitoring, pattern discovery, etc.). Time series is a series of data arranged in chronological order. According to the number of variables contained in the time series, it can be divided into univariate time series and multivariate time series. The user's electricity load data contains multiple information such as daily electricity consumption, voltage, and curr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/2255G06F16/2272G06F16/2474G06F16/27
Inventor 周向东王飞庞悦郭乃网苏运田英杰
Owner SHANGHAI MUNICIPAL ELECTRIC POWER CO
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products