Multi-source heterogeneous online network topic early identification method
A multi-source heterogeneous, early identification technology, applied in network data retrieval, network data indexing, other database retrieval and other directions, can solve problems such as difficulty in early discovery of topics, information production, dissemination, and complex interaction.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0035] refer to figure 2 , the specific operation process of this embodiment is:
[0036] 1) Analyze the characteristics of different online social network structures, design a distributed parallel crawler engine according to the characteristics of different online social network structures, and then use the distributed parallel crawler engine to crawl the original short text information published by online social networks, and then pass Chinese word segmentation and The text feature value extraction method performs text preprocessing on the original short text information disclosed by the online social network, and obtains a short text keyword set D 0 ;
[0037] Among them, the original short text information published by the crawled online social network includes the news headlines of each news site and the microblogs of each microblog platform, and the TF-IDF method is used to extract the original short text information through Chinese word segmentation and text feature v...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 

