Text index online updating method in cloud environment

An update method and cloud environment technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as the inability of the system to provide services, avoid frequent updates and transmission of excessive redundant data, and ensure continuous sex, to achieve the effect of online update

Inactive Publication Date: 2011-04-06
TSINGHUA UNIV
View PDF3 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

If the update time is long, it will inevitably cause ...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text index online updating method in cloud environment
  • Text index online updating method in cloud environment
  • Text index online updating method in cloud environment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The text index online update method in the cloud environment proposed by the present invention, its flow chart is as follows figure 2 shown, including:

[0031](1) After the user adds, deletes or updates a file to the text retrieval system, the identification information of the file is sent to the index module; the index module judges the received identification information according to the index segmentation rules defined in the text retrieval system, Determine the index slice to which the file belongs, and create an incremental data corresponding to the index slice for the file; the index module caches the incremental data, and adds, deletes, or updates the same index slice multiple times. When the user finishes adding, deleting or updating operations, the index module uploads all the incremental data of the index slices to the shared file system; the index module sends an index slice update command to the cluster master node in the text retrieval system, the The co...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a text index online updating method in cloud environment, belonging to the technical field of computer information retrieval. After a user adds, deletes or updates a file in a text retrieval system, an index module creates incremental data of index sheets belonging to the file and merges multiple groups of incremental data of the same index sheet. A cluster main node selects first batch of nodes and second batch of nodes by sequencing child nodes according to load size and executes index updating by batch. After each batch of node receives an updating command, the retrieval service is firstly stopped, the read incremental data is merged to own index sheet, and the retrieval service is recovered. The cluster main node decides the time for starting using the retrieval service of first batch of nodes and updating the second batch of nodes according to the index service switching conditions set by the user. Finally, the cluster main node recovers the retrieval service of all nodes to complete updating. The method reduces the requirements of index updating on network bandwidth and computer resources and shortens index updating time.

Description

technical field [0001] The invention relates to an online updating method of a text index in a cloud environment, belonging to the technical field of computer information retrieval. Background technique [0002] The development of the Internet and enterprise informatization has produced a large amount of unstructured data, such as product models, technical documents, management texts, emails, etc. Text data is one of the most common unstructured data. In order to realize the storage, indexing and retrieval of massive data, many text retrieval systems adopt cloud computing solutions. Web text search engines are the most common type of applications that provide text retrieval services, such as Google and Nutch. [0003] In a cloud environment, index data is generally divided into many index slices, and then deployed in the cluster. Each node holds some of the index slices. Each index slice generally has multiple backups to ensure fault tolerance and load balancing. Many text...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 王建民丁贵广张君
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products