Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Query representation and hybrid retrieval model construction method based on context sensing theme

A technology of model building and topic model, applied in the field of Internet information retrieval, can solve the problems of deviating from the original query, reducing query accuracy, and only considering, so as to achieve the effect of reducing query drift, promoting the improvement of retrieval effect, and reducing the introduction of noise

Inactive Publication Date: 2017-01-04
EAST CHINA NORMAL UNIVERSITY
View PDF5 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the existing extended word selection methods generally only consider the co-occurrence of the extended word and the original query word in the context window of the pseudo-relevance feedback, and there are still the following problems: (1) It is necessary to explicitly select which words to use as the final query Extension, some irrelevant words, even "harmful words" will still be introduced in the unsupervised situation
For example, in articles involving various environmental resources, the keyword "water shortage" appears frequently, but similar words such as "hydroelectric power" and "natural gas" also appear in its context, which will deviate from the original query and reduce the accuracy of the query. (2) The final query representation is still based on the dictionary space, ignoring the semantic information implicit in the query, such as potential topics; (3) The retrieval model based on this query representation mainly considers keyword matching, but ignores the relationship between documents and queries. Matching at the Semantic Level

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Query representation and hybrid retrieval model construction method based on context sensing theme
  • Query representation and hybrid retrieval model construction method based on context sensing theme
  • Query representation and hybrid retrieval model construction method based on context sensing theme

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] The present invention will be further described in detail in conjunction with the following specific embodiments and accompanying drawings. The process, conditions, experimental methods, etc. for implementing the present invention, except for the content specifically mentioned below, are common knowledge and common knowledge in this field, and the present invention has no special limitation content.

[0022] Such as figure 1 As shown, the query representation and hybrid retrieval model building method based on context-aware topics of the present invention includes the following steps:

[0023] Step 1: Based on the keyword set of the query, obtain the pseudo-relevant feedback document of the query, and select the context related to the query from the pseudo-related feedback document;

[0024] Step 2: Introduce the context-aware topic model, integrate the context into the context-aware topic model, mine the topic information hidden in the context window based on the corp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a query representation and hybrid retrieval model construction method based on a context sensing theme. The method includes the following steps that firstly, a pseudo relevance feedback document of query is obtained on the basis of a keyword set of query, and context related to query is selected from the pseudo relevance feedback document; secondly, a context sensing theme model is introduced, the context is fused into the context sensing theme model, and implicit theme information of a context window is mined on the basis of a corpus theme to obtain a corresponding theme vector; thirdly, query is represented by combining the theme vector and the keyword set, a hybrid retrieval model is constructed on the basis of the theme vector and the keyword set, and a final retrieval score is obtained.

Description

technical field [0001] The invention relates to the technical field of Internet information retrieval, in particular to a method for establishing a query representation and a hybrid retrieval model based on a context-aware topic model. Background technique [0002] Query representation has always been the core of the field of information retrieval, and the most common problem is that the user query is too short (contains only a few keywords), and it is easy to cause the relevant documents in the retrieval process to not match the query. For example, for the user query "short of water", if the document contains words related to the query such as "drought", although the correlation is high, but because the original query keyword "short of water" is not included, the final matching degree will be very low. low, thereby affecting the accuracy of the query. [0003] A common solution is query expansion based on pseudo-relevance feedback. This method is based on the preliminary ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/3344G06F16/3331
Inventor 贺樑陈琴胡琴敏
Owner EAST CHINA NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products