Method and system for recommending query based on user log

A user log, query recommendation technology, applied in the field of search engines, can solve problems such as easy generation of ambiguity, large amount of calculation, short query string, etc.

Active Publication Date: 2012-07-25
PEKING UNIV
View PDF3 Cites 73 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Although the technology of search engines is constantly improving and improving, there are still the following problems: On the one hand, statistics show that the query strings input by users are generally short, with an average of only 2-4 Chinese characters. The topic is relatively broad and prone to ambiguity, which may not exactly reflect the user's search intention; on the other hand, even if the keyword proposed by the user is accurate, the search engine will only return the results that match the keyword to the user. It is versatile and cannot well meet the personalized information needs of users
[0007] Many of the traditional query recommendations are based on documents, or use a large amount of document information, or use semantic resources edited by humans, but usually have a large amount of calculation, especially no longer suitable for frequent network content updates, new things emerging one after another, and diversified search intentions Recommended Web Retrieval System

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for recommending query based on user log
  • Method and system for recommending query based on user log
  • Method and system for recommending query based on user log

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0057] This embodiment records a method for query recommendation based on user logs, that is, for a query string q arbitrarily given by a user, from the user (query and click) logs in a certain period of time, find a query with good feedback results and Several query strings {q that are more relevant to the given query string 1 ,q 2 ,..q n} and recommended to the user.

[0058] Such as figure 1 As shown, the method for query recommendation includes the following steps:

[0059] S1: Select a data set in the search engine user log, and preprocess the selected data set to obtain an effective query log set as the first data set;

[0060] The user log refers to the record of the interaction between the user and the system. Usually, it includes information such as the query string submitted to the system by the user when querying, the submission time, the user IP address, the URL clicked by the user, etc. Table 1 shows a record of the system querying the user log main informati...

Embodiment 2

[0116] This embodiment records a system for performing query recommendation based on user logs for implementing the above method, including:

[0117] The data preparation module is used to select the data sets in the search engine user logs, and preprocess the selected data sets to obtain an effective query log set as the first data set; and extract each The support, popularity and recommendation indicators of the query string, select the query string and user records that meet the minimum threshold of these three feature indicators as the second data set;

[0118] The predictive model building module is used to select a plurality of typical query strings as training data, as the first training sample set; for each query string in the first training sample set, extract a query string with a certain co-occurrence degree and similarity in the second data set The query string with the degree of relevance is used as a candidate related query string, and then the correlation with the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and system for recommending query based on user log. The method comprises acquiring an effective query log set according to the data set in the user log; selecting a typical query string as the training set, extracting 6 characteristic indexes of each query string in the effective query log set, such as support degree, popularity, recommendation degree, co-occurrence degree, similarity, and association degree, and constructing a composite prediction model based on the training set; and extracting the 6 characteristic indexes of candidate query strings inputted by a user, inputting the extracted characteristic indexes into the composite prediction model as variables, calculating the relevancy between each candidate query string and a given query string, and outputting n query strings with higher rank. The system comprises a data preparation module, a prediction model construction module, and a processing output model for realizing the above method. By fully utilizing the user log of a search engine, the method and system can recommend query strings with higher quality for the user.

Description

technical field [0001] The invention relates to the technical field of search engines, in particular to a method and system for query recommendation based on user logs. Background technique [0002] Along with the fast growth of information amount on the World Wide Web, more and more people use search engines to find useful information on the Web. According to the statistical report of China Internet Network Information Center (CNNIC) in 2011, the utilization rate of search engines among various network application services has ranked first, and it has become the most important entrance for netizens to enter the Internet. When using a search engine, the user only needs to enter a query string (or query phrase, query) in the search box, and the retrieval system will provide a search result list (result list) according to the content entered by the user, and the user clicks on the URL of the corresponding result to arrive at the search engine. corresponding page. [0003] Al...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 王继民李雷明子王建冬
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products