Geoword-based multivariate power time series data indexing method

A time series data, multivariate technology, applied in other database indexing, digital data processing, other database retrieval, etc., can solve the problem of inability to retain multivariate time series information, uneven data distribution, and difficulty in overcoming high-dimensional data indexing disasters, etc. question

Inactive Publication Date: 2017-11-07
SHANGHAI MUNICIPAL ELECTRIC POWER CO +2
View PDF5 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The current method based on space division retains the approximate information of the multivariate time series, but basically divides the space fixedly. For the index, the data distribution may not be uniform, which will reduce the query efficiency
Indexes based on feature compression find similar time series through dimensionality reduction, but usually cannot preserve the original multivariate time series information
Indexes based on metric spaces use lower-bound filtering techniques such as triangle inequality to improve query efficiency, but such indexes are still difficult to overcome the "dimension disaster problem" faced by high-dimensional data indexes

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Geoword-based multivariate power time series data indexing method
  • Geoword-based multivariate power time series data indexing method
  • Geoword-based multivariate power time series data indexing method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0028] Such as figure 1 As shown, the GeoWord encoding algorithm includes the following steps:

[0029] 1) Input: i-th track point and its corresponding discretization base {a i1 ,...,a ij ,...,a im}. a ij yes The discretization base of .

[0030] 2) Output: GeoWord encoding of the i-th track point

[0031] 3) g i =null / / Save GeoWord code

[0032] 4) for j from 1 to m / / encode m variables sequentially

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a Geoword-based multivariate power time series data indexing method. The Geoword-based multivariate power time series data indexing method comprises the following steps of obtaining original multivariate power time series data, and performing dimension reduction on the original multivariate power time series data; conducting Geoword encoding on dimension-reduced multivariate power time series data, and obtaining an MTSAX expression of the multivariate power time series data; constructing MTSAX data indexes of a multivariate power time series according to the MTSAX expression. Compared with the prior art, the Geoword-based multivariate power time series data indexing method can conduct dynamic division, specifies the precision, has no overlapped nodes and the like.

Description

technical field [0001] The invention relates to the field of power data processing, in particular to a Geoword-based multivariate power time-series data indexing method. Background technique [0002] User electricity load data is a kind of massive time series data, which has the characteristics of large user scale, high data collection density, and close correlation with a large amount of economic and social data. Time series data indexing technology is very important to reduce the time cost of data query and retrieval and improve the efficiency of time series mining (such as classification, clustering, abnormal point monitoring, pattern discovery, etc.). Time series is a series of data arranged in chronological order. According to the number of variables contained in the time series, it can be divided into univariate time series and multivariate time series. The user's electricity load data contains multiple original information such as daily electricity consumption, volta...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/901Y02D10/00
Inventor 周向东王飞庞悦苏运郭乃网田英杰
Owner SHANGHAI MUNICIPAL ELECTRIC POWER CO
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products