Hot topic detection method based on RoBERTa-WWM and HDBSCAN algorithms
A hot topic and detection method technology, applied in computing, unstructured text data retrieval, semantic analysis, etc., can solve the problems of poor vector distinguishability, improve accuracy, avoid topic drift and evolution, and avoid poor distinguishability Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0043] refer to figure 1 As shown, the present invention discloses a hot topic detection method based on RoBERTa-WWM and HDBSCAN algorithm, and the hot topic detection method includes offline hot topic detection and online hot topic detection.
[0044] Offline hot topic detection is to detect hot topics contained in the existing data in the database. During the processing, the data is fixed and no new topics will be generated.
[0045] Online hot topic detection is to detect hot topics that occur on Internet media platforms within a certain time interval. During this process, the data is constantly updated, and it is necessary to consider the similarity between newly arrived reports and existing topics, as well as the impact of topic drift and evolution on topic detection results. In addition, the calculation efficiency of the algorithm also needs to be considered , to ensure the real-time performance of the calculation results.
[0046] Preferably, the offline hot topic det...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 



