Distributed B+ tree index system and building method

A construction method and technology of an indexing system, which are applied to the distributed B+ tree indexing system and construction field, can solve the problems of low query efficiency, lack of universal support for different index operations, lack of effective data indexing methods, etc., to achieve good scalability, improve Query efficiency, improve efficiency

Inactive Publication Date: 2009-11-11
PEKING UNIV
View PDF0 Cites 29 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Existing cloud storage systems such as Bigtable, Hbase, PNUTS, etc., all have some deficiencies in the indexing of massive data, mainly including: 1) The existing systems only have indexes on the primary key of the data, and lack of data indexing. Effective indexing methods for other attributes; if the client application server needs to query based on the retrieval conditions on some other attributes of the data, the system can only implement the query by traversing all the data, which makes the query efficiency very low
2) The existing system lacks an effective concurrency control transaction mechanism in index operations; such defects directly lead to the fact that the client application server must use its own code to ensure transaction consistency in a concurrent environment when using the index interface, which makes the client The development of the application server becomes complicated, and there is a lack of universal support for different index operations

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Distributed B+ tree index system and building method
  • Distributed B+ tree index system and building method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0016] The present invention will be described in detail below in conjunction with the accompanying drawings and embodiments.

[0017] The present invention provides a method and system for constructing a distributed B+ tree index based on a cloud storage environment, which can directly obtain the storage location information of webpages that meet user query conditions, thereby directly obtaining webpage data and returning it to users without the need for It traverses the collection of web pages saved in the entire system, which greatly improves the indexing efficiency. Among them, the B+ tree index is a data structure with high comprehensive efficiency for database indexing. Its characteristic is that all keywords appear in the linked list of leaf nodes, and the keywords in the linked list happen to be in order ;The search ends only when the leaf node is reached; the search process is equivalent to a binary search in the complete set of keywords; the non-leaf nodes are equiva...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a distributed B+ tree index system and a building method. The distributed B+ tree index system is characterized by comprising a master server, a transaction server cluster, an index server cluster and a version control server; the transaction server cluster comprises a plurality of transaction servers, and the index server cluster comprises a plurality of index servers; the master server is in charge of managing META data and conducts load balance scheduling on the index server cluster; the transaction server cluster is in charge of transaction control of index data access in a distributed file system; and the index server cluster is in charge of managing, reading and writing the index data in the distributed file system. The invention provides an index transaction mechanism with fine granularity and low network flow. As the basic transaction operation in the distributed B+ tree index system is based on key assignment granularity of a B+ tree and only dozens of bytes of B+ tree index key assignment pairs are needed to be transmitted when being the basic transaction is operated, the distributed B+ tree index system effectively realizes the transaction function of the index data in a concurrent environment.

Description

technical field [0001] The invention relates to a data index system and a construction method, in particular to a distributed B+ tree index system and a construction method used in a cloud storage environment. Background technique [0002] With the development of Internet technology and the advent of the Web2.0 era, due to the need to process and calculate massive data in the application process, such an application environment puts forward more requirements for the scalability, reliability and efficiency of the Web application data management system. high demands. With the maturity of cloud environment technology, cloud storage system shows more and more obvious advantages in today's web data management. [0003] Cloud storage refers to a system that integrates a large number of different types of storage devices in the network through application software to work together to provide data storage and business access functions through cluster applications, grid technology, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30H04L29/08
Inventor 汤成高军王腾蛟杨冬青
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products