Index generation method and index generation device based on MapReduce programming architecture
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- XIAMEN MEIYA PICO INFORMATION
- Publication Date
- 2012-04-25
Smart Images
Figure 1 Figure 2 Figure 3
Abstract
Description
technical field
[0001] The present invention relates to the field of Internet information technology, and more specifically, to an index generation method and device based on a MapReduce programming architecture. Background technique
[0002] Traditional index engines (such as: lucene and lucene-based Solr) create and manage indexes in a file-based way, which has many disadvantages:
[0003] 1. It is not suitable for storage in the cluster file system, because the index files are still large batches of small files, and placing them on the cluster file system will greatly reduce the read and write efficiency;
[0004] 2. It is not easy to perform cluster expansion. When a large amount of data is indexed at the same time, due to the frequent creation and merging of new fragments, it is easy to increase the response time of the index engine and reduce the efficiency.
[0005] Based on the above analysis, it can be found that it is imperative to introduce an external management...