Multidimensional interval querying method and system thereof

A query method and query system technology, applied in the field of computer processing, can solve the problems of storage overhead growth, slow speed of multi-dimensional interval query, long response time cannot meet the real-time retrieval of massive data, etc., and achieve the effect of storage overhead maintenance

Active Publication Date: 2010-10-20
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF4 Cites 43 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method will generate random read operations when retrieving data tables, and the speed of random read operations in distributed sequential tables is an order of magnitude lower than that of sequential scans (scan), so under this index mechanism, the multi-dimensional range query The speed is slow, and the response time is too long to meet the needs of current network applications for real-time retrieval of massive data
If you do not build a secondary index, but build a clustered index for the index column to improve query performance, the storage overhead will increase exponentially due to the copy of the underlying file system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multidimensional interval querying method and system thereof
  • Multidimensional interval querying method and system thereof
  • Multidimensional interval querying method and system thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] The flow of the multi-dimensional interval query method of the present invention is as follows: figure 1 shown.

[0041] Step S100, organize the copies used for backup data into multiple complementary clustered index tables that complement each other and verify each other; the complementary clustered index table creates a table with the column value plus the primary key of the original row plus the column value for each index column The length is the sequence table of the new primary key, and the data of the remaining columns in the original row is completely stored; the complementary clustered index table is used for row continuous scanning during query.

[0042] In a preferred implementation manner, in step S100, the complementary clustered index table includes all data in the original data table; the backup strategy of the underlying file system is in a closed state.

[0043] In step S200, the query string is converted into a query plan tree, and query execution is ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a multidimensional interval querying method and a system thereof. The method comprises the following steps of: 1. organizing a transcript for copying data to be a plurality of complementary cluster index tables which are mutually compensated and verified, wherein each complementary cluster index table builds a sequence table which takes the length of a column value, an original line primary key and a column value as a primary key by means of each index column, completely stores the data of the other columns in the original line, and is used for continuously scanning when querying; and 2. converting a query string into a query plane tree, and performing query optimization to complete the query. The invention can simultaneously meet the requirements of high performance, low storage cost and high reliability.

Description

technical field [0001] The invention relates to the field of computer processing, in particular to a multi-dimensional interval query system and method. Background technique [0002] Resource discovery requires queries with multiple conditions, most of which can be converted into multidimensional range queries. In fact, multi-dimensional interval query is a basic requirement of network applications. A simple example is the application of storing Internet image information. The designer may need to query the top 100 images with the highest number of clicks within a period of time. This query involves multiple attributes such as time and clicks (the columns of the data table, also called dimension, dimension) interval. [0003] As the amount of network application data continues to increase, it is difficult for existing methods to meet the requirements of high performance, low storage overhead and high reliability at the same time. By classifying the existing data models, i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 邹永强刘佳查礼王世才
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products