Query suggestion method based on query semantics and click-through data

A technology of click stream data and query data, applied in the field of information retrieval, can solve the problem of lack of effective semantic processing of query suggestions, achieve the effect of improving usability and interaction ability, and eliminating query ambiguity

Inactive Publication Date: 2011-11-23
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF3 Cites 31 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] The purpose of the present invention is to propose a query suggestion method based on query semantics and clickstream data for the lack of effective semantic processing of current query suggestions

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Query suggestion method based on query semantics and click-through data
  • Query suggestion method based on query semantics and click-through data
  • Query suggestion method based on query semantics and click-through data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] The preferred embodiments of the present invention will be specifically described below in conjunction with the accompanying drawings.

[0026] This embodiment specifically implements the query suggestion method based on query semantics and clickstream data in the present invention, and its process is as follows figure 1 shown, including the following steps:

[0027] 1. Preprocess the collected query log data, remove non-Chinese query strings, garbled data and meaningless symbols, and form a standardized query log library;

[0028] 2. Carry out preprocessing of word segmentation and filter stop words on the query data input by the user, and form a query data string containing multiple keywords;

[0029] 3. Calculate the similarity between the user query data string and the log information in the query log library one by one;

[0030] A variety of methods can be used for similarity calculation, such as cosine similarity calculation, Pearson coefficient similarity calcu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a query suggestion method based on query semantics and click-through data, which comprises the following steps of: 1, preprocessing collected query log data; 2, preprocessing participles and filtering stop words of query data input by a user; 3, calculating similarity of log information in a user query data string and a query log library one by one; 4, calculating semantic relativity of the log information in the user query data string and the query log library one by one on the basis of a word concept relevancy calculation method in the HowNet; 5, fusing the similarity and the semantic relativity, and calculating query semantic relativity of each piece of log information in the user query data string and the query log library; and 6, taking Top-N out and recommending to the user according to a descending relativity sequence in the step 5. By the method, query ambiguity can be effectively eliminated, an input error can be reminded, and usability and interactivity of an information retrieval system are improved.

Description

technical field [0001] The invention relates to a new query suggestion method—QSQSCD (Query Suggestion Based on the Query Semantics and Click-through Data), a query suggestion method based on query semantics and click-through data, which belongs to the field of information retrieval. Background technique [0002] At present, the main interaction mode adopted by search engines is that users input queries independently, and the search system provides retrieval results according to the queries input by users. However, in many cases, the query words entered by users cannot accurately express their search needs. On the one hand, the query words entered by users are usually relatively short—only two or three words on average; on the other hand, many search engines contain ambiguity or vague intentions; in addition, many times, the reason why users use search engines to search for information is Because they have little or no idea about the topic to be retrieved, it is difficult f...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 彭学平牛振东黄胜
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products