Real-time data storage and query method

A real-time data and query method technology, applied in the computer field, can solve the problems of long time spent in real-time data query, large database memory space, lack of index support for data, etc., achieve novel and efficient index organization structure, improve query performance, Improving the effect of real-time performance

Inactive Publication Date: 2015-07-22
RENMIN UNIVERSITY OF CHINA
View PDF9 Cites 35 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Specifically, the method of storing and reading real-time data using databases and data warehouses is mainly completed by building B+ trees or bitmap indexes in traditional databases, but this method occupies a relatively large amount of database memory space when storing real-time data. And it takes a long time to query real-time data, and it is difficult to meet the performance requirements of large-scale real-time data storage and query
The method of data storage based on Hadoop/Spark distributed file system i

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Real-time data storage and query method
  • Real-time data storage and query method
  • Real-time data storage and query method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0026] With the rapid development and popularization of Internet information technology, the scale of industrial application systems has rapidly expanded, and there have been data sets that cannot be acquired, managed, and processed within a certain period of time using traditional software technologies and tools. Data sets with this feature Colle...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a real-time data storage and query method. The method comprises the steps that when storage is conducted, partition is conducted on data to be stored by adopting consistency Hash sharding, primary index data carried with index information of the consistency Hash sharding are obtained from at least one distributed node of a distributed storage system, merger processing is conducted on at least two primary index data of one same distributed node to form a distributed file block corresponding to the distributed node, and a clustered index is established for the primary index data in the distributed file block; secondary index data carried with clustered index information are obtained from the distributed file block corresponding to the distributed node, and the secondary index data in the distributed file block are written in a disk in which the corresponding distributed node is located; when query is conducted, by means of the index information of the consistency Hash sharding established when the data to be inquired are stored and the clustered index information established in the distributed file block, a needed result can be located accurately and rapidly.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a real-time data storage and query method. Background technique [0002] In recent years, with the rapid development and popularization of computer and information technology, the scale of industrial application systems has expanded rapidly, and the data generated by industrial applications has grown explosively. Since the value of data is positively correlated with real-time performance, that is, the newer the data, the more valuable it is. Therefore, how to effectively store, analyze, and process increasing large-scale real-time data is a hot research topic now. [0003] At present, traditional methods mainly use databases and data warehouses or distributed file systems based on Hadoop / Spark to complete the storage and reading of large-scale real-time data, and then realize the analysis and processing of real-time data. Specifically, the method of storing and reading real-ti...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 陈跃国杜小勇覃雄派卞昊穹程鳌赵丽萍
Owner RENMIN UNIVERSITY OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products