Index maintenance method for supporting multiple data sources

An index maintenance and multi-data source technology, applied in the field of search engines, can solve problems such as delay, limited retrieval efficiency, and affecting users' real-time retrieval needs, so as to meet user requirements, ensure response time, and improve retrieval efficiency.

Inactive Publication Date: 2011-03-23
FUDAN UNIV
View PDF5 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] 1. When indexing data on multiple data sources at the same time, the existing method needs to continuously update the index library, which causes delays in indexing, thus affecting the user's need for instant retrieval
[0006] 2. Since the update of the index is carried out on a large index library, during the update period, it i

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Index maintenance method for supporting multiple data sources
  • Index maintenance method for supporting multiple data sources
  • Index maintenance method for supporting multiple data sources

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] Three processes are maintained on the index server, which are data loading of sub-index libraries, merging of sub-index libraries, and processing of user retrieval requests. figure 2 , image 3 , Figure 4 It is a flow diagram of the present invention.

[0044] The index library is an independent directory, and the subdirectories contained in it are sub-index libraries, and the specific files in each sub-index library are different according to different index organization forms.

[0045] 1. The data loading process of the sub-index library

[0046] The index server sets a directory for receiving new data files, and executes according to the following processing flow:

[0047] (1) Check whether the directory has new data files.

[0048] (2) If there is no new data file, go to (1).

[0049] (3) If there is a new data file, execute the following processing flow:

[0050] (a) Create a corresponding subdirectory in the index library, and name the directory as: DATE1-...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of search engines, in particular to an index maintenance method for supporting multiple data sources. An entire index library is divided into a series of sub index libraries, and each sub index library stores indexes in certain time granularity and comprises an independent catalogue and relevant documents. The method comprises the following three operating steps of: loading data of the sub index libraries, combining the sub index libraries and processing user retrieval requests. The real-time updating of the index is conveniently realized by setting the sub index libraries; the coexistence of sub index libraries with different time granularities is realized by setting an appropriate index combination detecting period; the retrieval requests within a time range limited by users are mapped onto the sub index libraries, the index can be updated in independent sub index libraries without influencing the user retrieval requests, and thus, the response time is ensured to meet user requirements.

Description

technical field [0001] The invention belongs to the technical field of search engines, and in particular relates to a method for updating and maintaining an index database. Background technique [0002] Enterprise informatization produces a large amount of original information or processed information, such as various text information, multimedia information, etc. The information contains various contents that the user is interested in, and it is necessary to store and retrieve the information effectively. The main characteristics of this information retrieval system: First, there are many sources of data. After the original information is generated, it is required to enter the retrieval system as soon as possible and be retrieved; second, users have higher requirements for the response time of information retrieval. , especially the update process of the index database cannot affect the response time of user retrieval. The third is that different types of enterprise users...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 曾剑平吴承荣
Owner FUDAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products