Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for compressing high-latitude missing time sequence supporting similarity retrieval

A time series and similarity technology, applied in the field of compression coding, can solve problems such as low accuracy, no support for stepped accuracy, and inability to recognize subtle differences in data sequences, achieving simple and easy-to-implement effects.

Inactive Publication Date: 2019-12-13
清华大学山西清洁能源研究院 +1
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Disadvantages of iSAX technology: 1. Does not support missing data processing; 2. Assume that the data points are all normalized; 3. This encoding technology requires a special index structure; 4. Although binary code conversion is used to represent each cardinality space and support ladders Type accuracy, but because the average value is used to represent each piece of data, there is a problem of low accuracy; 5. A special data structure is required to support the index
[0006] Disadvantages of Clipped Code: 1. Does not support stepped precision; 2. Does not support missing data processing; 3. Only supports simple data model and Hamming distance calculation, and cannot identify the subtle differences between two data sequences

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for compressing high-latitude missing time sequence supporting similarity retrieval
  • Method and device for compressing high-latitude missing time sequence supporting similarity retrieval

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary and are intended to explain the present invention and should not be construed as limiting the present invention.

[0029] The method and device for compressing high-latitude missing time series supporting similarity retrieval according to the embodiments of the present invention will be described below with reference to the accompanying drawings. First, the high-latitude missing time supporting similarity retrieval proposed according to the embodiments of the present invention will be described with reference to the accompanying drawings The compression method for the sequence.

[0030] figure 1 It is a flowchart of a method for compressing hi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and device for compressing high-latitude missing time sequence supporting similarity retrieval, and the method comprises the following steps: collecting a high-dimensional data sequence array, and carrying out the coding through employing a binary stepped identification; performing compression conversion on a high-dimensional time sequence of the high-dimensional data sequence array, and determining missing data points; and storing the stepped high-compression-ratio data with different accuracies in a clustered storage mode or an unclustered storage mode. According to the method, rapid compression conversion of a high-dimensional time sequence can be completed, recording of missing data points is supported, stepped high-compression-ratio storage with different accuracies is supported, a binary index technology widely used in the field of graphs is supported, and the method is simple and easy to implement.

Description

technical field [0001] The invention relates to the technical field of compression coding, in particular to a compression method and device for high-latitude missing time series supporting similarity retrieval. Background technique [0002] The data collected by the industrial Internet of Things system has time tags, so the collected data belongs to a high-dimensional data sequence array within a certain period of time. The process of processing on the big data processing platform involves data acquisition, storage and retrieval. At present, the time series supported by the big data platform use traditional database technology to support the operation of "points" in the time series, such as finding the maximum value in a time series, taking the mean value of a time period, etc. However, current artificial intelligence algorithms rely on the overall time series similarity comparison that can be achieved in the underlying data system. [0003] Similar to this technique are t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/2458G06F16/22G06F16/215G06F16/2453
CPCG06F16/215G06F16/22G06F16/2228G06F16/24532G06F16/2474
Inventor 张亮
Owner 清华大学山西清洁能源研究院