Query representation and hybrid retrieval model construction method based on context sensing theme

A technology of model building and topic model, applied in the field of Internet information retrieval, can solve the problems of deviating from the original query, reducing query accuracy, and only considering, so as to achieve the effect of reducing query drift, promoting the improvement of retrieval effect, and reducing the introduction of noise

Inactive Publication Date: 2017-01-04
EAST CHINA NORMAL UNIVERSITY
View PDF5 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the existing extended word selection methods generally only consider the co-occurrence of the extended word and the original query word in the context window of the pseudo-relevance feedback, and there are still the following problems: (1) It is necessary to explicitly select which words to use as the final query Extension, some irrelevant words, even "harmful words" will still be introduced in the unsupervised situation
For example, in articles involving various environmental resources, the keyword "water shortage" appe

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Query representation and hybrid retrieval model construction method based on context sensing theme
  • Query representation and hybrid retrieval model construction method based on context sensing theme
  • Query representation and hybrid retrieval model construction method based on context sensing theme

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0021] The present invention will be further described in detail with reference to the following specific embodiments and accompanying drawings. Except for the content specifically mentioned below, the process, conditions, experimental methods, etc. for implementing the present invention are all common knowledge and common knowledge in the field, and the present invention is not particularly limited.

[0022] like figure 1 As shown, the method for establishing a query representation and a hybrid retrieval model based on a context-aware topic of the present invention includes the following steps:

[0023] Step 1: based on the keyword set of the query, obtain a pseudo-relevant feedback document of the query, and select a context related to the query from the pseudo-relevant feedback document;

[0024] Step 2: Introduce a context-aware topic model, integrate the context into the context-aware topic model, mine the topic information implied by the context window based on the corp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a query representation and hybrid retrieval model construction method based on a context sensing theme. The method includes the following steps that firstly, a pseudo relevance feedback document of query is obtained on the basis of a keyword set of query, and context related to query is selected from the pseudo relevance feedback document; secondly, a context sensing theme model is introduced, the context is fused into the context sensing theme model, and implicit theme information of a context window is mined on the basis of a corpus theme to obtain a corresponding theme vector; thirdly, query is represented by combining the theme vector and the keyword set, a hybrid retrieval model is constructed on the basis of the theme vector and the keyword set, and a final retrieval score is obtained.

Description

technical field [0001] The invention relates to the technical field of Internet information retrieval, in particular to a method for establishing a query representation and a hybrid retrieval model based on a context-aware topic model. Background technique [0002] Query representation has always been the core of the field of information retrieval, and the most common problem is that the user query is too short (contains only a few keywords), and it is easy to cause the relevant documents in the retrieval process to not match the query. For example, for the user query "short of water", if the document contains words related to the query such as "drought", although the correlation is high, but because the original query keyword "short of water" is not included, the final matching degree will be very low. low, thereby affecting the accuracy of the query. [0003] A common solution is query expansion based on pseudo-relevance feedback. This method is based on the preliminary ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/3344G06F16/3331
Inventor 贺樑陈琴胡琴敏
Owner EAST CHINA NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products