Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Time-series similarity measurement method based on segmented statistical approximate representation

A similarity measurement and time series technology, applied in computing, special data processing applications, instruments, etc., can solve the problems of high computational complexity, single feature, weak expression ability of time series fluctuation patterns, etc., and achieve high global pattern matching accuracy , high local pattern matching accuracy, and the effect of overcoming the phase shift problem

Inactive Publication Date: 2015-03-25
ZHEJIANG UNIV
View PDF2 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Such methods include segmented aggregation approximation, segmented linear approximation, symbolic aggregation approximation, singular value decomposition, principal component analysis, etc. The first three methods need to segment the original time series first, and then process each sub-segment separately (Segmented aggregation approximation is to calculate the average value of each segment, segmented linear approximation is to perform line segment fitting on each segment, and symbolic aggregation approximation is to discretize the average value of each segment into symbols based on segmental aggregation approximation), because The extracted features are relatively simple, which makes it less capable of expressing time series fluctuation patterns
Singular value decomposition and principal component analysis are implemented by performing a unified characteristic matrix decomposition on all time series. The typical defects of these two types of methods are high computational complexity, and the decomposition process can only be completed in memory, and the scalability of data scale is very low.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Time-series similarity measurement method based on segmented statistical approximate representation
  • Time-series similarity measurement method based on segmented statistical approximate representation
  • Time-series similarity measurement method based on segmented statistical approximate representation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] The present invention will be described in further detail below in conjunction with the accompanying drawings.

[0026] Such as figure 1 Shown, a kind of time series similarity measurement method based on segmental statistical approximate representation of the present invention, comprises the following steps:

[0027] (1) Feature extraction, such as figure 2 As shown, it specifically includes the following sub-steps:

[0028] (1.1) Read the original time series T={t 1 ,t 2 ,...,t i ,...,t n} and Q={q 1 ,q 2 ,...,q i ,...,q n};

[0029] (1.2) For the time series T and Q, calculate the average m' and standard deviation σ' of the sampling points of T and the average m' and standard deviation σ' of the sampling points of Q respectively, according to the formula (1) for T and Q performs Z-normalization processing to obtain a normalized time series T'={t' 1 ,t′ 2 ,…,t′ i ,…,t′ n} and Q'={q' 1 ,q′ 2 ,…,q′ i ,…,q′ n};

[0030] t ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a time-series similarity measurement method based on segmented statistical approximate representation. The method comprises the steps of feature extraction and dynamic pattern matching. First, a time series is segmented into sub series, the various statistical features of the sub series are sequentially extracted, and local pattern feature vectors are constructed; then the distance between the local pattern feature vectors is calculated by the weighted Euclidean distance, local pattern matching is achieved, the matched local pattern is used as the sub program of a dynamic programming algorithm, and global pattern matching is achieved. The method is superior to other measurement methods by a large degree on the aspects of measurement precision and calculation efficiency, and plays an important role in daily activities and industrial production of people, for example, financial transactions, traffic control, air quality and temperature monitoring, industrial flow monitoring, medical diagnosis and the like. Large scale sampling data or high-speed dynamic data flow is subjected to similarity-based search, classification, clustering, prediction, abnormal detection, on-line pattern recognition and the like.

Description

technical field [0001] The invention relates to the fields of database, data mining, machine learning, information retrieval and the like, and in particular to a time series similarity measurement method based on segmental statistical approximate representation. Background technique [0002] Time series widely exist in people's daily life and industrial production, such as real-time transaction data of funds or stocks, daily sales data in the retail market, sensor monitoring data in the process industry, astronomical observation data, aerospace radar, satellite monitoring data, real-time Weather temperature and air quality index, etc. The industry has proposed many time series analysis methods so far, including similarity query methods, classification methods, clustering methods, forecasting methods, anomaly detection methods, etc. Among them, many methods need to judge the similarity of time series, such as kNN classifier, k-means clustering method, etc. Therefore, the tim...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/2462
Inventor 蔡青林陈岭孙建伶陈蕾英
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products