Method and system for identifying multiple query intents

A technology of query intent and identification method, applied in the field of information retrieval, can solve the problems of inability to identify user search intent well, lack of user search intent, and inability to accurately distinguish between different intents in multi-intent queries

Active Publication Date: 2013-08-07
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF2 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Clustering that comprehensively considers the similarity of content, clicked URL links, and session information includes query clustering proposed by Wen et al., which comprehensively considers the similarity of query content, clicked URL links, and session information similarity. This clustering method only uses a simple weighted form to comprehensively calculate the similarity of querying different information, an

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for identifying multiple query intents
  • Method and system for identifying multiple query intents
  • Method and system for identifying multiple query intents

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The present invention will be described below in conjunction with the accompanying drawings and specific embodiments.

[0033] figure 1 An embodiment of the query multi-intent recognition method is described, including the following steps:

[0034] Step 100, preprocessing the data.

[0035] In an embodiment, the data used for query multi-intent recognition may include query text obtained from query logs, user click information, session (session) information and other data. By preprocessing these data, the summary text of the query can be obtained, as well as the number of co-occurrences (the number of co-occurrences) of different queries in the same link or session.

[0036] In one embodiment, the summary text of the query can be obtained from:

[0037] a) In the query log, the query text content itself;

[0038] b) In the query log, information such as the title of the link clicked by the user when searching for the query, and the text summary in the link;

[0039] ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and a system for identifying multiple query intents. The method includes the steps: calculating probability feature vectors of the query intents according to a G-PLSI model; and calculating similarity between the probability feature vectors of the query intents of different queries, and clustering the queries according to the similarity. The G-PLSI model is used for simulating the generating process of a summary text, searching link clicking behaviors of the different queries with the same query intent and searching behaviors of the different queries with the same query intent in the same session. The probability feature vectors of the query intents reflect summary text information, clicking probability of the different queries on the same link and co-occurrence probability of the different queries in the same session. By the aid of the probability feature vectors of the query intents, search intents of a user can be more accurately reflected by comprehensively using query contents and clicking behaviors of the user.

Description

technical field [0001] The invention relates to the field of information retrieval, in particular to a query multi-intent recognition method and system. Background technique [0002] In the modern age where the amount of information continues to grow rapidly, search engines have become one of the main ways for people to obtain knowledge and useful information. According to the query log information statistics of search engines, the average query length is 2.21 words, of which about 62% are 1 or 2 words in length, and less than 4% are longer than 6 words in query length. Due to the short length of most queries, the search intent expressed by the user in the query often has ambiguity or multiple needs. For example, when the user searches for the word "apple", it may refer to the fruit or the apple. Company, and possibly products of Apple Inc. In addition, because users lack professional knowledge in some fields, it is difficult to use search words to express their meaning cl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 程学旗熊锦华程舒杨廖华明王元卓公帅
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products