Method of search engine log data mining facing user information requirements

A technology of information demand and search engine, which is applied in electronic digital data processing, special data processing applications, instruments, etc., can solve the problems of neglect, low accuracy, and inability to effectively identify user activity search engine logs, etc., to improve service quality , the effect of promoting development

Active Publication Date: 2013-06-19
ZHEJIANG HONGCHENG COMP SYST
View PDF4 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Compared with manual division, automatic division is simple and fast, but the disadvantage is that the accuracy is not high
[0006] However, the methods mentioned above ignore the fact that when users use search engines, they often perform multiple search activities with information requirements at the same time, which is shown in the search logs as simultaneous search behaviors with multiple search purposes. A complete query activity will be divided into several small pieces and recorded in the search engine log
Traditional methods often divide several small blocks of the same information requirement into multiple search records with different information requirements, which cannot effectively identify such user activity search engine logs with multiple information requirements

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method of search engine log data mining facing user information requirements
  • Method of search engine log data mining facing user information requirements
  • Method of search engine log data mining facing user information requirements

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The present invention proposes a method for mining log data of search engines facing user information requirements, the flow chart is as follows figure 1 As shown, the method can be divided into three stages: query log block classification, query similarity calculation and user information requirement provision.

[0037] Query log block classification:

[0038] The division of user search logs according to user IP and time is consistent with the traditional method, mainly to simplify multi-task division and narrow the scope of user information demand segmentation cycle.

[0039] method such as figure 2 Shown:

[0040] 1) mark the query time and IP of each user query according to the log information;

[0041] 2) For the obtained data, first pairwise adjacent queries (denoted as query Q i and Q i+1 ) user IP for comparison, if the IP is different, the query will be marked as a different block;

[0042] 3) For two adjacent queries with the same IP, it is judged whet...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the field of internet search engine log division, in particular to a method of search engine log data mining facing user information requirements. The method of the search engine log data mining facing the user information requirements comprises the steps of inquiring log block classification, inquiring similarity calculation and user information requirement provision. Search term similarity and search result similarity are calculated comprehensively to be used as query similarity, two queries are judged whether to have the same information requirements or not according to the query similarity, and division of search logs can be carried out effectively and quickly. The method of the search engine log data mining facing the user information requirements has the advantages that aiming at the defect that a traditional search engine quality evaluation method cannot describe complex and vague information requirements of users completely, a search engine user information requirement satisfaction evaluation method based on behavior logs is provided. User information requirements are used as a unit, user satisfaction is evaluated by analyzing search behaviors of users in search engine logs, personal requirements of the users are analyzed, the development of a search engine technology is promoted, and service quality of a search engine is improved.

Description

technical field [0001] The invention relates to the field of Internet search engine log division, in particular to a search engine log data mining method oriented to user information requirements. Background technique [0002] Search engine log research is an indispensable part of the Internet, especially for website optimization, SEO business needs to be done well, and scientific log analysis must be carried out. The user activity information contained in the search engine logs, such as the user's use time, the location of the clicked document, the number of searches, etc., can provide a basis for user behavior analysis and guide the technical improvement of the search engine. Search engine log division is the basis of search engine log research. At present, there are mainly two methods for dividing search engine logs: manual division and automatic division. The manual division method can be divided into user self-report and evaluator manual annotation. [0003] User self...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 吴勇王敬昌陈岭邵维
Owner ZHEJIANG HONGCHENG COMP SYST
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products