Social media hot topic extraction method and system

A technology of social media and extraction methods, applied in other database retrieval, network data retrieval, instruments, etc., can solve problems such as difficult to provide uniformly, achieve the effect of improving readability, reducing manual labeling data input, and low implementation cost

Pending Publication Date: 2020-04-10
FUJIAN YIRONG INFORMATION TECH +4
View PDF11 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Among them, data collection capabilities and processing can be provided by service providers (such as Sina), while the rapid discovery of specific events related to government departments, enterprises and other institutions is highly personalized and difficult to provide uniformly. Current technical research hotspots

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Social media hot topic extraction method and system
  • Social media hot topic extraction method and system
  • Social media hot topic extraction method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0047] A social media hot topic extraction method, see figure 2 Including the following steps:

[0048] S10. Filtering and summarizing the collected Internet text information data;

[0049] S20, using a text clustering algorithm to gather the filtered Internet information data of the same theme;

[0050] S30. After the Internet text information data is clustered, a text abstract is generated for the topic selected by the user, thereby completing hotspot extraction and displaying the corresponding topic.

[0051] A method for extracting hot topics in social media, wherein the collected Internet text information data is filtered and summarized. In the process of collecting corpus, it is necessary to collect as comprehensive and different corpus as possible, and the same article is often reproduced in On multiple platforms, take the 15,000 news corpus about Trump as an example: the corpus with different content accounts for only 8.5% of the total corpus, and in this only 8.5%,...

Embodiment 2

[0102] A social media hot topic extraction system, refer to figure 1 The system described therein includes a data screening module, a topic clustering module and a topic summary extraction module;

[0103] The data screening module filters and summarizes the collected Internet text information data;

[0104] Described topic clustering module adopts text clustering algorithm to gather the filtered Internet text information data of the same topic;

[0105] The topic summary extraction module generates a text summary from the corresponding clustered Internet text information data according to the topic selected by the user, thereby completing the hotspot extraction.

[0106] A kind of social media hot topic extracting system described, wherein said data screening module specifically performs the following steps:

[0107] S11. Calculate the sensitive hash fingerprint of the acquired Internet text information data;

[0108] S12. Using sensitive hash fingerprints to filter repeat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a social media hot topic extraction method and system. The hot topic extraction method comprises the following steps: filtering and concluding collected Internet text information data, then gathering Internet data information of the same topic by adopting a text clustering algorithm, clustering the Internet data information, and generating a text abstract for the selected topic. The hot topic extraction system comprises a data screening calculation module, a topic clustering module and a topic abstract extraction module, wherein the data screening calculation module isused for filtering and concluding the collected Internet text information data; the topic clustering module adopts the text clustering algorithm to cluster the filtered Internet text information dataof the same topic; and the topic abstract extraction module is used for generating a text abstract from the corresponding clustered Internet text information data according to a topic selected by a user so as to complete hotspot extraction. The method and the system improve the extraction quality of the social media hot topics and have relatively high practical value.

Description

technical field [0001] The invention belongs to the technical field of data analysis, and relates to a method and system suitable for extracting hot topics from various social media data. Background technique [0002] With the continuous and in-depth development of information technology and the Internet, social media, including Weibo and WeChat public accounts, have become more and more influential and time-sensitive. main channel. However, the vigorous development of social media has also led to the continuous growth of the relevant data scale. Taking Sina Weibo as an example, its monthly active users of Weibo in 2018 were 462 million, an increase of 70 million+ for three consecutive years, and the number of vertical fields expanded to 60, of which 32 fields had a monthly reading volume of over 10 billion. How to obtain social media information related to brand and development in a timely manner in the mass and rapidly changing social media has become an important topic ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/951G06F16/35G06F16/34G06F16/335
CPCG06F16/951G06F16/35G06F16/345G06F16/335
Inventor 宋立华王秋琳梁懿庄莉陈睿欣于灏
Owner FUJIAN YIRONG INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products