Microblog sorting model building and microblog diversity retrieval methods

A technology for sorting models and establishing methods, which is applied in the fields of information retrieval and social media retrieval, and can solve problems such as declining effects, brevity, and irregular grammar, so as to improve user experience, improve accuracy and coverage, and reduce information redundancy. Effect

Active Publication Date: 2017-03-08
中国国防科技信息中心
View PDF4 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] As a kind of social media, Weibo has short text and irregular grammar, which makes t...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Microblog sorting model building and microblog diversity retrieval methods
  • Microblog sorting model building and microblog diversity retrieval methods
  • Microblog sorting model building and microblog diversity retrieval methods

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The present invention will be described in further detail below in conjunction with the accompanying drawings and specific embodiments.

[0031] Such as figure 1 As shown, a method for establishing a microblog sorting model, the method includes:

[0032] Step S1) constructs a training data set; the training data set includes a series of query words, several microblogs corresponding to each query word, and the sequence of these microblogs obtained by manual labeling (as a training standard answer);

[0033] Step S2) extracting the attribute of the microblog corresponding to each query word in the training data set;

[0034] Step S3) using the attributes of the microblog corresponding to each query term to extract the correlation feature and similarity feature of each blog post;

[0035] In traditional correlation ranking learning methods, only the correlation between query phrases and retrieved blog posts is considered. The feature between blog posts in the present in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a microblog sorting model building method. The method comprises the steps of S1) constructing a training data set, wherein the training data set comprises a series of query words, each query word corresponds to a plurality of microblogs, and an arrangement sequence of the microblogs is obtained in a manual annotation manner to serve as a training standard answer; S2) extracting attributes of the microblogs corresponding to each query word in the training data set; S3) extracting a relevance feature and a similarity feature of each blog article; and S4) building and training a sorting model. Based on the model, the invention furthermore provides a microblog diversity retrieval method. By use of the method, diversified retrieval results are returned when a user retrieves related information in the microblogs, so that information redundancy is reduced, the accuracy and coverage of a retrieval result of a retrieval system can be effectively improved, and the user experience is enhanced.

Description

technical field [0001] The invention relates to the technical field of information retrieval, in particular to the field of social media retrieval, and in particular to the establishment of a microblog sorting model and a microblog diversity retrieval method. Background technique [0002] Microblog retrieval belongs to the field of information retrieval and is an important means to extract effective information from massive microblog data. In microblog retrieval, users generally express their query intentions by entering shorter query words (average 1.64 words), which often lead to ambiguity or uncertainty in user query intentions. For example, when a user enters the query word "apple", the retrieval system needs to determine whether the user's query intention needs information related to the Apple company or information related to fruit such as apples. [0003] However, the current microblog retrieval system cannot accurately understand the user's query intention. In this ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/334G06F16/335G06F16/36G06F16/9535
Inventor 罗准辰王莹于洋罗威韦博陈钧
Owner 中国国防科技信息中心
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products