Microblog query expansion method based on multiple layers

A query expansion, multi-level technology, applied in the field of Internet information search, can solve problems such as damage to the accuracy of retrieval results, reduce retrieval efficiency, query drift, etc., to alleviate the mismatch problem, reduce query drift, and refine query expansion. Effect

Active Publication Date: 2015-09-16
EAST CHINA NORMAL UNIVERSITY
View PDF1 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the query expansion of the existing technology brings a large number of words irrelevant to the original query, which not only reduces the retrieval efficiency, but also causes query d

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Microblog query expansion method based on multiple layers

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0020] See attached figure 1 In the present invention, the original Weibo query term is extracted from the corresponding corpus PRF layer and the external source web layer as the candidate query expansion term, and the candidate query expansion term and the original Weibo query term are used as the tag set to the PRF layer Labeled documents, use Labeled LDA to perform semantic modeling on the labeled PRF documents, and then map candidate query expansion words and original Weibo query words from different sources to a unified semantic layer, and dig out their potential semantics, and based on The semantic similarity between them filters out candidate expansion words that have nothing to do with the meaning of the original Weibo query words, and add them as query expansion words to the original Weibo query words to form new Weibo query words, and use the expanded new Weibo query The query results can be more in line with the user’s real information needs. The specific expansion o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a microblog query expansion method based on multiple layers. The microblog query expansion method based on the multiple layers is characterized in that keywords are extracted from a PRF (Pseudo Relevance Feedback) layer of a corpus corresponding to original microblog query words and a web layer of an external source to serve as candidate query expansion words, the candidate query expansion words and original microblog query sentences are merged as a label set for labeling documents in the PRF layer, moreover, Labeled LDA is utilized to semantically model for the labeled PRF documents, the candidate query expansion words and the microblog query words coming from the different sources are then mapped to a unified semantic layer, the potential semantics of the candidate query expansion words and the microblog query words are mined, and according to the semantic similarity between the candidate query expansion words and the microblog query words, the candidate query expansion words which are irrelevant to the semantics of the microblog query words are filtered out, so that a new microblog query word is formed for more accurate query and retrieval. Compared with the prior art, the microblog query expansion method based on the multiple layers has the advantages of less query drifts, high retrieval efficiency and high accuracy, and in particular, the microblog query expansion method based on the multiple layers effectively integrates expansion words to achieve an optimal expansion effect, so that query results can meet the real information requirement of users.

Description

technical field [0001] The invention relates to the technical field of Internet information search, in particular to a multi-level microblog query expansion method. Background technique [0002] With the rise of social networks, Weibo has become an important platform for people to share real-time information. Faced with a large number of microblogs published every day involving various aspects, users usually use the method of retrieval if they want to find the content they are interested in. However, on the one hand, the query words entered by the user are few and not accurate enough; on the other hand, the microblog itself has a character limit and the text is short, so the query results often do not meet the real information needs of the user. In order to solve this problem, the user query is usually expanded during the retrieval process. The expansion of user query is mainly divided into two categories: the expansion based on the query corpus itself and the expansion bas...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/951G06F40/284G06F40/30
Inventor 胡琴敏陈琴贺樑
Owner EAST CHINA NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products