Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Microblog query expansion method based on multiple layers

A query expansion, multi-level technology, applied in the field of Internet information search, can solve problems such as damage to the accuracy of retrieval results, reduce retrieval efficiency, query drift, etc., to alleviate the mismatch problem, reduce query drift, and refine query expansion. Effect

Active Publication Date: 2015-09-16
EAST CHINA NORMAL UNIVERSITY
View PDF1 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the query expansion of the existing technology brings a large number of words irrelevant to the original query, which not only reduces the retrieval efficiency, but also causes query drift and damages the accuracy of the retrieval results. Words are effectively integrated to achieve the optimal expansion effect, so that the query results can meet the real information needs of users

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Microblog query expansion method based on multiple layers

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] See attached figure 1 , the present invention extracts keywords as candidate query expansion words in the corresponding corpus PRF layer and the web layer of the external source of the original microblog query words, and uses the candidate query expansion words and the original microblog query words as the label set to pair in the PRF layer Labeled documents, using Labeled LDA to carry out semantic modeling on labeled PRF documents, and then map candidate query expansion words and original Weibo query words from different sources to a unified semantic layer to mine their potential semantics, and according to Semantic similarity between them, filter out candidate expansion words that have nothing to do with the semantics of the original Weibo query words, add them as query expansion words to the original Weibo query words to form new Weibo query words, and use the expanded new Weibo query words Words are used to query, and the query results can better meet the real infor...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a microblog query expansion method based on multiple layers. The microblog query expansion method based on the multiple layers is characterized in that keywords are extracted from a PRF (Pseudo Relevance Feedback) layer of a corpus corresponding to original microblog query words and a web layer of an external source to serve as candidate query expansion words, the candidate query expansion words and original microblog query sentences are merged as a label set for labeling documents in the PRF layer, moreover, Labeled LDA is utilized to semantically model for the labeled PRF documents, the candidate query expansion words and the microblog query words coming from the different sources are then mapped to a unified semantic layer, the potential semantics of the candidate query expansion words and the microblog query words are mined, and according to the semantic similarity between the candidate query expansion words and the microblog query words, the candidate query expansion words which are irrelevant to the semantics of the microblog query words are filtered out, so that a new microblog query word is formed for more accurate query and retrieval. Compared with the prior art, the microblog query expansion method based on the multiple layers has the advantages of less query drifts, high retrieval efficiency and high accuracy, and in particular, the microblog query expansion method based on the multiple layers effectively integrates expansion words to achieve an optimal expansion effect, so that query results can meet the real information requirement of users.

Description

technical field [0001] The invention relates to the technical field of Internet information search, in particular to a multi-level microblog query expansion method. Background technique [0002] With the rise of social networks, Weibo has become an important platform for people to share real-time information. Faced with a large number of microblogs published every day involving various aspects, users usually use the method of retrieval if they want to find the content they are interested in. However, on the one hand, the query words entered by the user are few and not accurate enough; on the other hand, the microblog itself has a character limit and the text is short, so the query results often do not meet the real information needs of the user. In order to solve this problem, the user query is usually expanded during the retrieval process. The expansion of user query is mainly divided into two categories: the expansion based on the query corpus itself and the expansion bas...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/951G06F40/284G06F40/30
Inventor 胡琴敏陈琴贺樑
Owner EAST CHINA NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products