Time sequence similarity measurement method based on self-adaptive piecewise statistical approximation

A similarity measurement and time series technology, applied in computing, special data processing applications, instruments, etc., can solve the problems of weak expression ability of time series fluctuation patterns, low scalability of data scale, high computational complexity, etc., and achieve high Local pattern matching accuracy, high global pattern matching accuracy, and the effect of overcoming phase shift

A similarity measurement and time series technology, applied in computing, special data processing applications, instruments, etc., can solve the problems of weak expression ability of time series fluctuation patterns, low scalability of data scale, high computational complexity, etc., and achieve high Local pattern matching accuracy, high global pattern matching accuracy, and the effect of overcoming phase shift

CN104820673AInactive Publication Date: 2015-08-05ZHEJIANG UNIV

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Time sequence similarity measurement method based on self-adaptive piecewise statistical approximation
  • Time sequence similarity measurement method based on self-adaptive piecewise statistical approximation
  • Time sequence similarity measurement method based on self-adaptive piecewise statistical approximation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The present invention will be described in further detail below in conjunction with the accompanying drawings.

[0036] Such as figure 1 Shown, a kind of time series similarity measurement method based on adaptive subsection statistical approximation of the present invention, comprises the following steps:

[0037] (1) Adaptive segmentation, such as figure 2 As shown, it specifically includes the following sub-steps:

[0038] (1.1) Read the original time series T={t 1 ,t 2 ,...,t i ,...,t n} and Q={q 1 ,q 2 ,...,q i ,...,q n};

[0039] (1.2) For time series T and Q, calculate the average value m' and standard deviation σ' of the sampling points of T and the average value m' and standard deviation σ' of the sampling points of Q, respectively, according to the formula (1) for T and Q performs Z-normalization processing to obtain a normalized time series T'={t' 1 ,t' 2 ,...,t' i ,...,t' n} and Q'={q' 1 ,q' 2 ,...,q' i ,...,q' n};

[0040] ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a time sequence similarity measurement method based on self-adaptive piecewise statistical approximation. The method comprises the following steps of firstly, segmenting a time sequence into subsequences containing complete fluctuation trends based on time sequence coded identification turning points; secondly, extracting various statistical characteristics of each subsequence in sequence so as to configure local pattern character vectors; and lastly, computing a distance between the local pattern character vectors by utilizing a normalized distance so as to realize local pattern matching, and using the local pattern matching as a subprogram of a dynamic programming algorithm so as to realize global pattern matching. The time sequence similarity measurement method is better than the other measurement method in the aspects of measurement precision and computational efficiency to a larger extent, and plays an important role in daily activities and industrial production of people, such as similarity search, classification, clustering, predication, anomaly detection, on-line pattern recognition and the other processing of large-scale sampling data or high-speed dynamic data flow in banking transaction, traffic control, air quality and temperature monitoring, industrial flow monitoring, medical diagnosis and the other application.

Description

technical field [0001] The present invention relates to the fields of database, data mining, machine learning, information retrieval, etc., and especially relates to time series data analysis and mining. Background technique [0002] Time series widely exist in people's daily life and industrial production, such as real-time transaction data of funds or stocks, daily sales data in the retail market, sensor monitoring data in the process industry, astronomical observation data, aerospace radar, satellite monitoring data, real-time Weather temperature and air quality index, etc. The industry has proposed many time series analysis methods so far, including similarity query methods, classification methods, clustering methods, forecasting methods, anomaly detection methods, etc. Among them, many methods need to judge the similarity of time series, such as kNN classifier, k-means clustering method, etc. Therefore, the time series similarity measurement method has a wide range of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
05 Aug 2015
Publication
CN104820673A
IPC
G06F17/30
Inventors
蔡青林; 陈岭