Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Time sequence similarity measurement method based on self-adaptive piecewise statistical approximation

A similarity measurement and time series technology, applied in computing, special data processing applications, instruments, etc., can solve the problems of weak expression ability of time series fluctuation patterns, low scalability of data scale, high computational complexity, etc., and achieve high Local pattern matching accuracy, high global pattern matching accuracy, and the effect of overcoming phase shift

Inactive Publication Date: 2015-08-05
ZHEJIANG UNIV
View PDF3 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Such methods include segmented aggregation approximation, segmented linear approximation, symbolic aggregation approximation, singular value decomposition, principal component analysis, etc. The first three methods need to segment the original time series first, and then process each sub-segment separately : The segmented aggregation approximation is to calculate the average value of each segment; the segmented linear approximation is to do line segment fitting on each segment; The extracted features are relatively single, which makes it less capable of expressing time series fluctuation patterns
Singular value decomposition and principal component analysis are realized by performing a unified eigenmatrix decomposition on all time series; the typical defects of these two types of methods are that they have high computational complexity, and the decomposition process can only be done in memory, and the data Very low scalability at scale

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Time sequence similarity measurement method based on self-adaptive piecewise statistical approximation
  • Time sequence similarity measurement method based on self-adaptive piecewise statistical approximation
  • Time sequence similarity measurement method based on self-adaptive piecewise statistical approximation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The present invention will be described in further detail below in conjunction with the accompanying drawings.

[0036] Such as figure 1 Shown, a kind of time series similarity measurement method based on adaptive subsection statistical approximation of the present invention, comprises the following steps:

[0037] (1) Adaptive segmentation, such as figure 2 As shown, it specifically includes the following sub-steps:

[0038] (1.1) Read the original time series T={t 1 ,t 2 ,...,t i ,...,t n} and Q={q 1 ,q 2 ,...,q i ,...,q n};

[0039] (1.2) For time series T and Q, calculate the average value m' and standard deviation σ' of the sampling points of T and the average value m' and standard deviation σ' of the sampling points of Q, respectively, according to the formula (1) for T and Q performs Z-normalization processing to obtain a normalized time series T'={t' 1 ,t' 2 ,...,t' i ,...,t' n} and Q'={q' 1 ,q' 2 ,...,q' i ,...,q' n};

[0040] ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a time sequence similarity measurement method based on self-adaptive piecewise statistical approximation. The method comprises the following steps of firstly, segmenting a time sequence into subsequences containing complete fluctuation trends based on time sequence coded identification turning points; secondly, extracting various statistical characteristics of each subsequence in sequence so as to configure local pattern character vectors; and lastly, computing a distance between the local pattern character vectors by utilizing a normalized distance so as to realize local pattern matching, and using the local pattern matching as a subprogram of a dynamic programming algorithm so as to realize global pattern matching. The time sequence similarity measurement method is better than the other measurement method in the aspects of measurement precision and computational efficiency to a larger extent, and plays an important role in daily activities and industrial production of people, such as similarity search, classification, clustering, predication, anomaly detection, on-line pattern recognition and the other processing of large-scale sampling data or high-speed dynamic data flow in banking transaction, traffic control, air quality and temperature monitoring, industrial flow monitoring, medical diagnosis and the other application.

Description

technical field [0001] The present invention relates to the fields of database, data mining, machine learning, information retrieval, etc., and especially relates to time series data analysis and mining. Background technique [0002] Time series widely exist in people's daily life and industrial production, such as real-time transaction data of funds or stocks, daily sales data in the retail market, sensor monitoring data in the process industry, astronomical observation data, aerospace radar, satellite monitoring data, real-time Weather temperature and air quality index, etc. The industry has proposed many time series analysis methods so far, including similarity query methods, classification methods, clustering methods, forecasting methods, anomaly detection methods, etc. Among them, many methods need to judge the similarity of time series, such as kNN classifier, k-means clustering method, etc. Therefore, the time series similarity measurement method has a wide range of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 蔡青林陈岭孙建伶陈蕾英
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products