A method for generating tweet summaries based on topic relevance

A correlation and topic technology, applied in the field of text summarization in natural language processing, can solve the problem of not introducing specific topics and social network data, and achieve the effect of rich information, good novelty, and reduced redundant information
CN112883716BActive Publication Date: 2022-05-03CHONGQING UNIV OF POSTS & TELECOMM

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Patents(China)
Current Assignee / Owner
CHONGQING UNIV OF POSTS & TELECOMM
Publication Date
2022-05-03

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention discloses a method for generating tweet summaries based on topic correlation, which includes establishing a thesaurus for each topic through the distribution of nouns in each topic; and calculating The correlation between a tweet and a topic; calculate the public recognition degree according to the network interaction information; combine the public recognition degree and the topic correlation to get the final tweet significance; use the maximum marginal correlation algorithm for deduplication processing, Output summary. This method selects tweets as summaries from topic relevance and tweet salience, and controls the redundancy of the final summaries, so that the generated tweet summaries comprehensively consider the summarization topics, diversity, and social identity. As a result, abstracts with higher topic relevance, novelty and summarization are obtained.
Need to check novelty before this filing date? Find Prior Art

Description

Technical field

[0001] The technical field involves text summary technology in natural language processing, which is used to automatically generate a summary of the topic of Twitter speech. Specifically, given a particular topic and several tweet texts, a summary related to that topic is obtained. Background

[0002] With the rapid development of social network media and self-media, abstract research on summarizing massive data has been spawned. Since there is no large-scale public dataset of social network data, most of the current summary studies of social network data are traditional unsupervised methods. Methods based on statistical features, mainly based on the relative position of sentences, word frequency characteristics, etc. to study, such methods are easy to implement, but the obtained features are often relatively simple; based on the method of graph model, such methods regard the sentences in the text as nodes, the similarity score between the texts as the edges betw...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More