Query expansion method based on multisource positive and negative external feedback information

A technology of external feedback and query expansion, applied in special data processing applications, instruments, network data retrieval, etc., can solve the problems of small scale, increase the risk of query expansion, and cannot well reflect the real query intention of users, so as to reduce The effect of expanding risk and improving performance

Active Publication Date: 2017-07-18
BEIJING UNIV OF TECH
View PDF2 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The first method mostly uses methods such as text clustering, latent semantic indexing (LSI for short) and similarity dictionaries to expand the query. However, due to the relatively fixed composition of the local corpus and the small scale, it cannot well reflect the real user experience. query intent
The s

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Query expansion method based on multisource positive and negative external feedback information
  • Query expansion method based on multisource positive and negative external feedback information
  • Query expansion method based on multisource positive and negative external feedback information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] Such as figure 1 As shown, the embodiment of the present invention provides a query expansion method based on multi-source positive and negative external feedback information, including the following steps:

[0041] Step (1) Get Tweets blog posts

[0042] Obtain Tweets blog post, which consists of two parts: number and text

[0043] Step (2) Obtain user interest words

[0044] The user interest file consists of three parts: number, query word, and interest description, from which the user query word is parsed as the user interest word.

[0045] Step (3) Tweets preprocessing

[0046] Step (3.1) filters non-English Tweets and Tweets that are less than two words in length.

[0047] Step (3.2) removes punctuation marks, numbers, and URLs in Tweets, and converts all letters to lowercase;

[0048] Step (3.3) segment Tweets based on simple spaces and remove stop words. Different word forms in English are regarded as different words, for example, "organ" and "organs" are r...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a query expansion method based on multisource positive and negative external feedback information. An expansion risk is reduced by introducing a regular constraint into a processing of fusing external query information; therefore, new query can be rapidly and effectively built, and thus a search result more conforms to user needs. Compared with a traditional feedback search method, the technical scheme of the method provided by the invention has the effect of significantly enhanced performance.

Description

technical field [0001] The invention belongs to the field of word information processing, in particular to a query expansion method based on multi-source positive and negative external feedback information. Background technique [0002] The emergence of social media (such as Twitter, Facebook, Google+) has profoundly changed the way people produce and consume information. The biggest difference between him and mainstream news media websites (such as CNN or nytimes) is that people in social networks are information Consumers are also producers of information. Since the sources of information in social networks are not only diverse but also chaotic, this increases the difficulty for users to obtain information. [0003] The traditional method query expansion methods are mainly divided into two types according to the different expansion sources: 1) local query expansion methods using local query document sets as the expansion source; 2) global expansion methods using external k...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/3326G06F16/3344G06F16/951
Inventor 杨震李超阳
Owner BEIJING UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products