A non-word-segmented burst topic detection method for microblog
A topic detection, non-word segmentation technology, applied in unstructured text data retrieval, text database clustering/classification, special data processing applications, etc., can solve problems such as difficulty in detecting emergent topics, and achieve the effect of improving overall performance
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0041] The existing burst topic detection methods based on Chinese word segmentation are all based on word frequency information of feature words. For Chinese microblogging, it is first necessary to perform Chinese word segmentation, construct the feature trajectory of the feature words, calculate the burst feature words according to a certain burst detection algorithm, and then use the set of highly relevant feature words to represent the burst topics.
[0042] For Chinese microblogging, this approach has certain flaws. Due to the diversity of Weibo users, Weibo terminology is flexible and non-standard, such as diaosi, Bogu Kailai, China on the tip of the tongue, Tangshan earthquake and other words or strings. There are a large number of sudden topics induced by new words or strings in Weibo, but these new words or meaningful strings cannot be divided according to the Chinese word segmentation dictionary, so that it is impossible to accurately find sudden topics in Weibo.
...
PUM

Abstract
Description
Claims
Application Information

- R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com