Accessing identification index system and accessing identification index library generation method
A technology of access identification and indexing system, which is applied in the field of access identification and indexing system, can solve problems such as limiting the accuracy of websites, and achieve satisfactory service results
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0052] Such as figure 1 As shown, it includes a server module connected to the network, a log analysis module and an index module. The server module can be implemented through a standard open source module such as an apache module, or through other server modules.
[0053]The log analysis module is used to process the access log files from the server module, generate incremental index data, and transmit the incremental index data to the index module. The index module is used to process the incremental index data from the log analysis module, generate and store the index data. The index data may be access identification index data or / and keyword index data; the access identification index data is index data from access identification to keywords; the keyword index data is index data from keywords to access identification.
[0054] Furthermore, you can choose to distribute the log analysis module and index module on different machines and / or different machine groups, and use t...
Embodiment 2
[0056] Since the log file data from the server module is very large, the data processing workload is correspondingly heavy. Therefore, on the basis of Embodiment 1, as figure 2 As shown, the log analysis module may further include: a log preprocessing module and an incremental index generation module, so as to realize step-by-step processing of data and reduce the workload of single processing. An incremental index transmission module is also set in the log analysis module for sending data to the index module.
[0057] The log preprocessing module is used to process the access log files from the server module to generate query preprocessing data; the incremental index generation module is used to process the query preprocessing data to generate incremental index data; the incremental index transfer module uses to transmit the incremental index data to the index module.
[0058] The log preprocessing module and the incremental index generation module can be set in the same m...
Embodiment 3
[0065] Further, such as Figure 5 As shown, on the basis of the second embodiment, the log analysis module may also include an access identification query string library generation module, which is used to process the incremental index data from the incremental index generation module and store the processed Incremental index data.
[0066] In order to improve the reflection speed of the access identification index system, the access identification index system may only perform relatively simple processing on the access log, such as word segmentation processing, that is, generate an access identification index and save it in the access identification index library. By accessing the ID query string library, the extracted original user request string can be saved for offline natural language processing, such as synonym expansion, syntax analysis of sentence structure, semantic analysis, etc., to analyze and obtain more accurate deep semantic information. Reflect the user's poin...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 