Abstract method based on social media microblog specific topic

A microblogging, social technology, applied in the fields of natural language processing and social media text mining, can solve the problems of integrating into a unified optimization model, rarely, and not using the microblogging network structure.

Inactive Publication Date: 2018-05-04
TIANJIN UNIV
View PDF2 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The above methods mainly examine the importance, and more external methods are used for diversity, such as Maximal Marginal Relevance (MMR), and there are few optimizations that integrate coverage, importance, and diversity into a unified methods in the model; in addition, these methods do not exploit the underlying microblog network structure in social media, which may contain more semantic cues

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Abstract method based on social media microblog specific topic
  • Abstract method based on social media microblog specific topic
  • Abstract method based on social media microblog specific topic

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0092] The technical solutions of the present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0093] Taking topic 11 and topic 9 in the 12 Twitter real data sets constructed as examples, the implementation method of the invention is given. The process of the whole system includes four steps: microblog topic selection, microblog summarization framework integrating sparse reconstruction and microblog potential network structure and diversity method, fast microblog summarization algorithm based on Nesterov accelerated gradient descent, and summary generation.

[0094] 1) Weibo topic screening

[0095] Preliminary screening of topics: For each topic, according to the tags #a#, #b#, and the keywords a and b obtained after removing the "#", retrieve the microblogs containing the above information from a real Twitter data set, and based on the daily Draw a sequence diagram of the number of microblogs for this topi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an abstract method based on a social media microblog specific topic. The method comprises the following steps of (1) obtaining W according to a following formula in the description, wherein the formula merges a microblog abstract optimization model of group sparsity study and social regular term parameters; the S is a text matrix; a data set is obtained through the calculation from TF-IDF; the W is a refactoring coefficient matrix; Lambda is a group sparsity regular term parameter; the L is D-T and is an Laplace matrix; (2) obtaining the importance Score (i) of the i-thmicroblog by calculating the normal formulas of the i-th line of the W shown in the description; sequencing the microblog according to the importance; further screening the front k microblogs to be used as the abstract, wherein the Score (i) is shown in the description. The abstract method is based on the basic framework of the sparse reconstruction and merges the social media content and a socialnetwork structure; the obtained microblog abstract is more similar to an expert mutual evaluation result in aspects of three evaluation indexes of ROUGE-1, ROUGE-2 and ROUGE-SU4 through being compared with the existing model.

Description

technical field [0001] The invention relates to the fields of natural language processing and social media text mining, in particular to a method for summarizing specific topics based on social media microblogs. Background technique [0002] With the rapid development of social media platforms, such as Weibo, Twitter, etc., their fast and convenient features make people gradually rely on these platforms for obtaining information. At the same time, due to the large number of microblog users, when an event occurs, a large number of related microblogs will emerge in a short period of time to describe various aspects of the event topic, which fully reflects the large-scale, real-time, and fragmented nature of microblogs. and the weak normativeness of short texts. [0003] The development of social media has created greater redundancy due to the frequent interaction and mutual influence of people. Massive microblogs can easily overwhelm people in information, making it difficult...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/345G06F16/35
Inventor 贺瑞芳段兴义张雪菲李三飞
Owner TIANJIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products