High-availability distribution type full-text index method

A full-text index and distributed technology, applied in the field of query services, can solve problems such as inability to support different types of data index operations, index failure to recover normally, increase query time and network overhead, etc.

Inactive Publication Date: 2014-11-12
ZHEJIANG UNIV
View PDF4 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In addition, some other distributed index systems divide the index according to a fixed size. During the query process, all index fragments need to be queried, which increases the query time and network overhead.
[0006] 3. Generally speaking, distributed indexing systems are designed to meet specific needs and cannot support dynamic indexing operations on different types of data
When a node in the index cluster fails, use the backups of other nodes in the system to restore it. However, if these backup nodes fail at the same time, the index on the failed node cannot be restored normally.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • High-availability distribution type full-text index method
  • High-availability distribution type full-text index method
  • High-availability distribution type full-text index method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0062] The main purpose of the present invention is to propose a method for establishing a distributed full-text index system. The distributed full-text indexing system provides massive text indexing and query services to the outside world. The present invention will be fully and detailedly described below with reference to the accompanying drawings. like figure 1 As shown, a distributed full-text index system can be constructed through a high-availability distributed full-text index method used in the present invention. A complete distributed full-text indexing method should consist of the following steps:

[0063] 1. Start the basic service system, including distributed file system, distributed columnar database and metadata directory service. The distributed file system can use Hadoop's distributed file system HDFS, the distributed database can use HBase, and the metadata directory service can be implemented using Zookeeper. All the above-mentioned systems can be replac...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a high-availability distribution type full-text index method. The method comprises the following steps of: firstly, starting a basic service system, and then starting an index cluster service and an inquiry cluster service on each node; establishing, updating and deleting an index on full-text data; and finally, inquiring the index. According to the high-availability distribution type full-text index method, an inquiring and indexing process can be separated and the increment or batch type establishment of existing indexes can be simultaneously supported; the batch type indexes can be used for establishing an index for large-scale data in short time; and the increment type indexes avoid reestablishing the index. Index files can be divided into three layers of structures comprising an index file, an index fragment and an index sub-fragment, thus enhancing the expandability and the availability of the index file. According to the high-availability distribution type full-text index method, a dynamic index task configuration task is provided; and parameters in the index task are arranged to dynamically meet requirements of establishing the index by a user through different types of data.

Description

technical field [0001] The invention relates to the field of information indexing and searching, and more specifically, relates to a method for constructing a distributed full-text index for massive text data and providing highly available query services. Background technique [0002] With the development of the Internet, especially the emergence of Web2.0, the amount of text information is increasing exponentially. Users want to be able to effectively manage massive text data, and quickly search these texts to obtain corresponding information. [0003] The emergence of search engines such as Google, Baidu, and Bing has met the needs of users for information search. The core technology used by search engines is to collect various data information in the network through web crawlers, index these data, and then provide query services to the outside world. As the data information continues to grow, the size of the created index files also increases, resulting in the stand-alo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 陈岭鲁伟明余斌
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products