Pseudo-correlation feedback model information retrieval method and system based on semantic similarity

A technology of semantic similarity and pseudo-relevant feedback, applied in digital data information retrieval, special data processing applications, instruments, etc., can solve problems such as inaccuracy and incomplete query input

Active Publication Date: 2019-05-31
HUAZHONG NORMAL UNIV
View PDF3 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In practical problems, users often have incomplete or inaccurate query input

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Pseudo-correlation feedback model information retrieval method and system based on semantic similarity
  • Pseudo-correlation feedback model information retrieval method and system based on semantic similarity
  • Pseudo-correlation feedback model information retrieval method and system based on semantic similarity

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] The technical solutions of the present invention will be described in detail below in conjunction with the accompanying drawings and embodiments.

[0049] The present invention proposes to score each sentence and the original query Q based on the semantic similarity, and then scan each word. The total score of the word is the sum of the sentence scores of all sentences where the word is located, and the semantic similarity As an additional weight, it is fused into the pseudo-relevance feedback model to achieve query expansion to improve the accuracy of retrieval.

[0050] The embodiment proposes an information retrieval method that integrates semantic similarity into a pseudo-relevance feedback model, including integrating the semantic similarity of a sentence into a pseudo-relevance feedback model to realize information retrieval, including generating query expansion words in a pseudo-relevance document collection At this time, the first N feedback documents of the ini...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a pseudo-correlation feedback model information retrieval method and system based on semantic similarity. The method comprises the following steps: carrying out a first query from a target document set according to a query keyword to extract a pseudo-related document set, carrying out query expansion by adopting a Rochio algorithm, carrying out query expansion according to the semantic similarity of sentences, fusing the results of the two query expansion methods, and carrying out a second query to realize final information retrieval. According to the invention, when theextended lexical item is selected; the importance degree relationship between the query lexical item and the extension word in the traditional method can be highlighted; the semantic correlation of the sentences where the lexical items are located is combined; the condition that lexical items are associated when sentence semantics are similar in reality is met; According to the method and the device, the conditions that the semantics are related even if the lexical items are different are represented, so that the query words have better regional indexing in a multi-semantic environment, a large amount of useless and irrelevant information can be removed from mass information, more accurate candidate words can be obtained, and the precision of expanded query and final retrieval can be improved.

Description

technical field [0001] The invention belongs to the technical field of information retrieval, and in particular relates to an information retrieval method and system for integrating semantic similarity into a pseudo correlation feedback model. Background technique [0002] In the age of information competition, browsing and obtaining desired information with the help of search engines is an important part of people's daily life. However, the extremely rich network resources and the rapid expansion of the total amount of information make it difficult for users to efficiently and accurately obtain and identify important information. Information processing technology urgently needs a more effective theory and method to deal with the growing mass of data. As a classic text processing technology, information retrieval can adapt to this requirement and quickly become a research hotspot in the current information processing research field. [0003] Information Retrieval refers to ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/9535G06F17/27
Inventor 何婷婷潘敏王俊美曾俊王雪彦
Owner HUAZHONG NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products