Pseudo-correlation feedback extended query method based on question-answering system

A technology of pseudo-relevant feedback and query expansion, applied in biological neural network models, semantic analysis, instruments, etc., can solve the problems of inability to identify user search intentions, poor model generalization, and inability to fully utilize the information of feature intersection.

Active Publication Date: 2021-02-02
SHANGHAI JIAO TONG UNIV
View PDF6 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Its shortcoming is that only the context-related word embedding vector is used, and the semantic interaction information between the query and the document is completely ignored. At the same time, ignoring features such as word frequency may cause semantic drift of the extended query.
The problem with this method is that the influence of statistical features is calculated based on rules, and the generalization of the model is poor. It only uses the embedding vector of the term, and there is no effective means to mine the semantic connection between the query and the pseudo-related documents.
[0012] After analyzing relevant domestic and foreign patents and related research, the following conclusions can be drawn: the current pseudo-relevance feedback algorithm in the field of information retrieval cannot effectively utilize the semantic interaction information between user queries and pseudo-related documents, resulting in the relevant algorithm being unable to identify the user's Search intent, biased in query expansion
The use of statistical information such as word frequency cannot get rid of the fixed rule model, the generalization is poor, and the information of feature intersection cannot be fully utilized

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Pseudo-correlation feedback extended query method based on question-answering system
  • Pseudo-correlation feedback extended query method based on question-answering system
  • Pseudo-correlation feedback extended query method based on question-answering system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0074] The following describes the preferred embodiments of the present application with reference to the accompanying drawings to make the technical content clearer and easier to understand. The present application can be embodied in many different forms of embodiments, and the protection scope of the present application is not limited to the embodiments mentioned herein.

[0075] The idea, specific structure and technical effects of the present invention will be further described below to fully understand the purpose, features and effects of the present invention, but the protection of the present invention is not limited thereto.

[0076] An embodiment of the invention:

[0077] as attached figure 1 , the process of retrieval query, including:

[0078] Step 1. Initial search

[0079] Through the correlation retrieval model, the document set D is retrieved for the first time according to the keywords of the query Q.

[0080] First, the original query and document set wil...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a pseudo-correlation feedback extended query method based on a question-answering system. According to the pseudo-correlation feedback extended query method based on the question and answer system, mature semantic mining modules, such as attention mechanisms, in the question and answer system are used for reference, so that the model can truly understand the search intention of a user, and extended lexical items are selected according to interactive semantic information of query and documents. Compared with a traditional model, due to the fact that semantic interactioncharacteristics are added, the effect of selecting the extended lexical items is remarkably improved. In addition, a neural network based on a paired loss function is further added to understand statistical characteristics of lexical items, and word frequency, inverse document frequency and the like are used for correcting the possible semantic drift problem of the semantic model. Practice provesthat compared with a traditional pseudo-correlation feedback algorithm, the method has higher sorting accuracy and better robustness, and can be applied to various search scenes.

Description

technical field [0001] The invention relates to the field of information retrieval methods, in particular to the research on an extended query method based on a pseudo-correlation feedback algorithm in a search engine. Background technique [0002] Today, with the development of information technology, more and more people start to use search engines to search, browse and query relevant knowledge. Search engines use specific strategies to retrieve customized information from the Internet and return them to users based on user needs and some related algorithms. user. However, due to the diversification of the Internet ecology and the rapid growth of the amount of information, it is difficult for users to give the required queries accurately and efficiently, so users may only provide some short queries or a few query terms for search engines, which is It may cause the search engine to be unable to fully obtain the user's query intention, thus making it impossible to return th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/332G06F40/194G06F40/216G06F40/30G06K9/62G06N3/04
CPCG06F16/3329G06F40/30G06F40/216G06F40/194G06F2216/03G06N3/045G06F18/214
Inventor 侯嘉伟张伟楠
Owner SHANGHAI JIAO TONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products