Unlock instant, AI-driven research and patent intelligence for your innovation.

Accessing identification index system and accessing identification index library generation method

A technology of access identification and indexing system, which is applied in the field of access identification and indexing system, can solve problems such as limiting the accuracy of websites, and achieve satisfactory service results

Active Publication Date: 2008-12-10
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since the user profile is summary information, a large amount of specific information in the user's original behavior records may not be reflected in the user profile, which limits the accuracy of the website in understanding user behavior and user needs, and makes it difficult for the website to provide specific information for users. more efficient service

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Accessing identification index system and accessing identification index library generation method
  • Accessing identification index system and accessing identification index library generation method
  • Accessing identification index system and accessing identification index library generation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0052] Such as figure 1 As shown, it includes a server module connected to the network, a log analysis module and an index module. The server module can be implemented through a standard open source module such as an apache module, or through other server modules.

[0053]The log analysis module is used to process the access log files from the server module, generate incremental index data, and transmit the incremental index data to the index module. The index module is used to process the incremental index data from the log analysis module, generate and store the index data. The index data may be access identification index data or / and keyword index data; the access identification index data is index data from access identification to keywords; the keyword index data is index data from keywords to access identification.

[0054] Furthermore, you can choose to distribute the log analysis module and index module on different machines and / or different machine groups, and use t...

Embodiment 2

[0056] Since the log file data from the server module is very large, the data processing workload is correspondingly heavy. Therefore, on the basis of Embodiment 1, as figure 2 As shown, the log analysis module may further include: a log preprocessing module and an incremental index generation module, so as to realize step-by-step processing of data and reduce the workload of single processing. An incremental index transmission module is also set in the log analysis module for sending data to the index module.

[0057] The log preprocessing module is used to process the access log files from the server module to generate query preprocessing data; the incremental index generation module is used to process the query preprocessing data to generate incremental index data; the incremental index transfer module uses to transmit the incremental index data to the index module.

[0058] The log preprocessing module and the incremental index generation module can be set in the same m...

Embodiment 3

[0065] Further, such as Figure 5 As shown, on the basis of the second embodiment, the log analysis module may also include an access identification query string library generation module, which is used to process the incremental index data from the incremental index generation module and store the processed Incremental index data.

[0066] In order to improve the reflection speed of the access identification index system, the access identification index system may only perform relatively simple processing on the access log, such as word segmentation processing, that is, generate an access identification index and save it in the access identification index library. By accessing the ID query string library, the extracted original user request string can be saved for offline natural language processing, such as synonym expansion, syntax analysis of sentence structure, semantic analysis, etc., to analyze and obtain more accurate deep semantic information. Reflect the user's poin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The related access mark index system comprises: a server, a log analysis module to process log file from the server and generate incremental index data to send to an index module that processes the data to generate and store the index data. This invention lets the website research user action deeply to learn user request and provide individual service.

Description

technical field [0001] The invention relates to an access identification index system and a generation method of an access identification index library based on the access identification index system. Background technique [0002] On the Internet, when a user visits a web site, the web site will generate an access identifier for the user to record that the user has visited the web site. [0003] In the prior art, user identification is implemented through cookie technology. A cookie is a piece of text that a web server saves on the user's hard drive. A cookie allows a Web site to save information on a user's machine and retrieve it later. A Web site generates a unique ID for each visitor, and then saves it on each user's machine in the form of a Cookie file. A cookie allows a website to save information about the website on the user's machine so that the website can remember the last state the browser was in. A user ID is simply state information -- if the ID exists on t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 李彦宏朱洪波刘建国郭眈周利民王湛刘子正袁杰王闯杨文凯
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD