Efficient mass data indexing method in cloud computing

A massive data, cloud computing technology, applied in the field of cloud computing, can solve problems such as low access rate

Inactive Publication Date: 2015-03-04
LANGCHAO ELECTRONIC INFORMATION IND CO LTD
View PDF2 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] The technical problem to be solved by the present invention is: in view of the massive and distributed characteristics of data in the cloud computing environment, and the problem of low access rate in the existing distributed B-tree index method, the present invention proposes a cloud computing environment The high-efficiency indexing method for massive data uses logs to record the split history of nodes on the basis of distributed B-trees, and accesses distributed B-trees efficiently and concurrently based on the split history of nodes, which effectively improves the massive and distributed data in the cloud computing environment. Data Access and Indexing Efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Efficient mass data indexing method in cloud computing
  • Efficient mass data indexing method in cloud computing
  • Efficient mass data indexing method in cloud computing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] Below according to accompanying drawing of description, in conjunction with specific embodiment, the present invention is further described:

[0036] A high-efficiency indexing method for massive data in cloud computing. The method uses logs to record the split history of nodes on the basis of distributed B-trees, and efficiently and concurrently accesses distributed B-trees based on the split history of nodes, which effectively improves cloud computing. Massive, distributed data access and indexing efficiency in the environment.

[0037] Described method specific content is as follows:

[0038] Each server has a B-tree node split log, which is used to record the split history of all B-tree nodes distributed in the server;

[0039] The splitting history is a record file sorted in chronological order. Its structure is shown in Figure 1. Each splitting of a node is recorded in the log as a record;

[0040]The record structure in the log is: , where LowValue and UpValue ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an efficient mass data indexing method in cloud computing. According to the method, the node splitting history is recorded by use of a log on the basis of a distributed B-tree and the distributed B-tree is efficiently and simultaneously accessed based on the node splitting history, and therefore, the accessing and indexing efficiency of mass distributed data in the cloud computing environment can be effectively improved. The efficient mass data indexing method in the cloud computing is superior to an affair method because distributed locks need to be added to all nodes in the path from the root to a leaf node to be accessed in the affair method, and the efficient mass data indexing method only needs to lock the leaf node to be accessed; the efficient mass data indexing method is superior to a link method because the number of nodes to be traversed in the efficient mass data indexing method is smaller than that of the link method.

Description

technical field [0001] The invention relates to the field of cloud computing, in particular to an efficient indexing method for massive data in cloud computing. Background technique [0002] At present, a large number of Internet applications based on massive data and providing various information services have emerged in the cloud computing environment. [0003] The data of these applications is characterized by massive volume and rapid growth. The system distributes all data to multiple storage nodes by hashing the key of each data, so as to realize the scalable storage of rapidly growing massive data. The hash method has higher efficiency for query using keywords, but it cannot improve the efficiency of query using non-keywords, and does not support range queries. For users, in addition to using keywords to query data, they also like to use other attributes to query data or perform range queries. For example, in an online video system (such as Youtube), each video cont...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/24553G06F16/2246
Inventor 杨晋博尹艳艳张新玲
Owner LANGCHAO ELECTRONIC INFORMATION IND CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products