Multi-dimensional interval query method and system based on cloud computing

A query method and query system technology, which are applied in the field of multi-dimensional indexing of massive data, can solve the problem that the system cannot simultaneously meet the multi-dimensional interval query requirements of collection-type big data, fast writing ability, dynamic scalability, etc., and achieve good scalability. performance, improve query speed, and eliminate the effect of repeated reads

Inactive Publication Date: 2014-03-26
INST OF COMPUTING TECH CHINESE ACAD OF SCI +1
View PDF4 Cites 64 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0017] The technical problem to be solved by the present invention is to overcome the defects that the existing system cannot simultaneously meet the multi-dimensional interval query requirements, fast writing ability, and dynamic scalability of large data collection, and propose a multi-dimensional interval query method and system based on cloud computing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-dimensional interval query method and system based on cloud computing
  • Multi-dimensional interval query method and system based on cloud computing
  • Multi-dimensional interval query method and system based on cloud computing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0090] The present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments, but not as a limitation of the present invention.

[0091] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0092] The invention provides a multi-dimensional interval query method and system for collecting big data. The invention integrates the advantages of high throughput and good scalability of the HDFS storage system, and the good multi-dimensional interval index capability of Grid File, and uses the DHT-based Key / Value database to store DGFIndex, thereby distributing queries in a balanced manner in the server cluster , so as to provide multi-dimensional interval query capabilities directly on HDFS.

[0093] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail b...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a multi-dimensional interval query method based on cloud computing. The method includes the steps of index construction and multi-dimensional interval data query. The index construction step includes: automatically constructing and storing a distributed grid file index and metadata according to an externally entered index construction command. The multi-dimensional interval data query step includes: positioning a data block to be read, according to an externally entered query command and on the basis of the distributed grid file index and metadata, and automatically and evenly distributing reading requests to nods of a server cluster, parallelly processing query requests in the system, parallelly acquiring all query results, and collecting the results before returning to users. The invention further provides a multi-dimensional interval query system based on the distributed grid file index.

Description

technical field [0001] The invention relates to the field of multidimensional indexing of massive data, in particular to an index technology related to multidimensional interval query of massive data. Background technique [0002] In the smart grid, massive collection of data needs to be efficiently stored and quickly queried. Compared with the big data generated in the Internet field, such as social networks and search engines, the collected big data has unique characteristics: (1) high collection frequency; (2) mass collection terminals generate massive data; (3) collection The number of fields in each record is fixed; (4) It has distinct spatial and temporal characteristics. Not only that, the query for collection data also has unique characteristics: (1) Multi-dimensional interval query features are obvious; (2) The query dimension is generally fixed; (3) Aggregated values ​​(such as the total number of records, Sum, Max, Min, etc.) have a large proportion of inquiries...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30H04L29/08
CPCG06F16/2272
Inventor 刘越虎嵩林李彦虎刘万涛陈建李祥珍吴凯峰王志强张春光裴旭斌肖政崔蔚
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products