Unlock instant, AI-driven research and patent intelligence for your innovation.

Search result diversification ordering method based on hierarchical structure subtopic

A sorting method and technology of search results, applied in the direction of network data retrieval, network data indexing, and other database retrieval, etc., can solve the problems of difficult sub-topics and difficult matching of user intentions

Inactive Publication Date: 2016-04-13
RENMIN UNIVERSITY OF CHINA
View PDF1 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, since the subtopics in the diversification algorithm are automatically generated according to the query, it is difficult to perfectly match the real user intent
However, the current diversification method mainly uses subtopics in the form of a list, and it is difficult to find subtopics with appropriate granularity that can perfectly match the real user intent.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Search result diversification ordering method based on hierarchical structure subtopic
  • Search result diversification ordering method based on hierarchical structure subtopic
  • Search result diversification ordering method based on hierarchical structure subtopic

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] The present application will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0048] Define the expressions and related concepts of hierarchical subtopics:

[0049] For a given query q, we use R = {d 1 , d 2 ,...,d m} represents the initial set of documents that have not yet been diversified, with T q ={t 1 ,t 2 ,...,t n} represents the set of n subtopics related to the query. Given P(d|q) represents the probability that document d is related to query q, P(d|t) represents the probability that document d is related to subtopic t, and P(t|q) represents the importance of subtopic t in query q degree. At present, most of the diversification algorithms based on subtopics use T q , P(d|q), P(d|t), P(t|q) reorder the initial document R, and obtain a diversified result document, denoted as D. In the hierarchical diversification model, due to the introduction of multi-layer subtopics, the present invention needs to rede...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a search result diversification ordering method based on a hierarchical structure subtopic. The method comprises the following steps: 1) defining an expression method of a hierarchical structure dendric subtopic of a search word; 2) estimating the correlation between the hierarchical structure subtopic and a searched document; and 3) establishing a search result diversification model based on the hierarchical structure subtopic of the search word; wherein step 3) is realized by any of two ordering methods: a) performing diversification ordering on documents according to a hierarchical structure topic novelty model; and b) performing diversification ordering on documents according to a hierarchical structure topic proportional model. The invention defines a searched hierarchical structure subtopic and a calculating method of correlation between multilayer subtopics and searched documents, and proposes a search result diversification algorithm based on the hierarchical structure subtopic, wherein a real user intention can be matched more accurately by flexibly using the subtopics of different granularities, so that the diversification of search results is improved.

Description

technical field [0001] The invention relates to a method for diversifying and sorting search results based on hierarchical structure subtopics. Background technique [0002] Internet information has covered people's daily life more and more comprehensively, and users have gradually become accustomed to relying on search engines to find the information they need. A large number of studies have shown that a considerable part of the queries submitted to search engines are short text queries. Due to the small amount of information, these short text queries are usually ambiguous or have multiple meanings when interpreting user intent. For common ambiguous queries, for example, when searching for "Apple", some users may be looking for information about the famous Apple company, while others are concerned about information about fruits and apples; when searching for "National People's Congress", some users may Maybe they are looking for information about Renmin University of Chin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/951
Inventor 窦志成文继荣胡莎
Owner RENMIN UNIVERSITY OF CHINA