Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Index system and index method for retrieving time sequences of ultra-large-scale data

A retrieval index, ultra-large-scale technology, applied in the field of data processing, can solve problems such as inability to return, inability to cope with technical challenges, performance and efficiency problems, and achieve the effect of improving speed

Active Publication Date: 2017-05-31
SOUTH CHINA NORMAL UNIVERSITY
View PDF12 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] As mentioned above, the time-series indexes of various time-series databases can effectively solve various time-series data management problems in time-series databases. However, they have a common feature that they are all designed for traditional relational time-series databases. They are designed to deal with conventional-scale data volumes, usually millions of data, and they cannot cope with the technical challenges brought about by the ultra-large-scale data of more than 1 billion in the current big data era
When the total amount of data in the data set to be processed becomes larger and larger, the above-mentioned time series index of the time series database will have serious performance and efficiency problems, making it impossible to return effective time series retrieval results within an acceptable time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Index system and index method for retrieving time sequences of ultra-large-scale data
  • Index system and index method for retrieving time sequences of ultra-large-scale data
  • Index system and index method for retrieving time sequences of ultra-large-scale data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The specific embodiment of the present invention will be further described below in conjunction with accompanying drawing:

[0035] refer to figure 1 , a time-series indexing system for ultra-large-scale data, including:

[0036] The vertical level index module includes a multi-layer index, and each level index includes a hash function and multiple data sets. The original data is mapped to the data set through the hash function of the first level index, and the data in the data set is passed through The hash function of the index of the next level is mapped to the data set of the next level;

[0037] The time axis index module is used to establish an event list and a time list for the data of the data set in the lowest-level index; the event list is used to record the activation status of the event corresponding to the data at a certain fixed point time, and the time list is used for Records the total number of events that occurred before a certain point in time.

[...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an index system and an index method for retrieving time sequences of ultra-large-scale data. The index system comprises a vertical hierarchical index module and a corresponding time axis index module. The vertical hierarchical index module comprises a plurality of hierarchies of indexes, each hierarchy of indexes comprises a hash function and a plurality of data sets, and data in the data sets are mapped into a corresponding next hierarchy of data set by the aid of the hash function of the corresponding next hierarchy of indexes; the time axis index module is used for creating event lists and time lists. The index system and the index method have the advantages that original big data sets can be ultimately distributed into a plurality of small data sets by means of hierarchy -by- hierarchy hash mapping by the aid of the hierarchical index module, operation such as query processing, data loading and storage optimization can be independently executed on each small data set, accordingly, the vertical hierarchical index module can be combined with the time axis index module, risks of full-table scanning operation in time sequence retrieval operation procedures can be prevented, and the time sequence retrieval speeds can be greatly increased; the index system and the index method for retrieving the time sequences of the ultra-large-scale data can be widely applied to the field of data processing.

Description

technical field [0001] The invention relates to the field of data processing, in particular to a time-series indexing system and method for ultra-large-scale data. Background technique [0002] The field of time-series data management of time-series database also involves the time-series retrieval operation on the data in the database. Internally, the time-series database also implements various time-series data management functions efficiently by establishing a data index for the time-series data. In general, these data indexes in the time series database are mainly divided into two categories, one is the index based on the B+ tree structure, and the other is the index based on the R tree structure. For example, there are several specific index structures such as Time Index, Snapshot Index, CheckpointIndex, Archivable Time Index, Overlapping B+ tree, etc. [0003] Timeline Index is an index structure proposed by Martin Kaufman et al. in 2013. It mainly serves the time-ser...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/2272G06F16/2282
Inventor 赵淦森李振宇王欣明张海明庄序填唐华李卓越林成创刘创辉马朝辉廖智锐
Owner SOUTH CHINA NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products