Real-time text clustering method based on Jaccard distance
A text clustering and distance technology, which is applied in the fields of natural language processing and big data, can solve problems such as slow processing speed and low accuracy of real-time clustering, and achieve the effects of improving operational efficiency, user experience, and improving results
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0021] The following examples are presented to illustrate certain embodiments of the invention and should not be construed as limiting the scope of the invention. The content disclosed in the present invention can be improved simultaneously from materials, methods and reaction conditions, and all these improvements should fall within the spirit and scope of the present invention.
[0022] Such as figure 1 As shown, a real-time text clustering method based on Jaccard distance, which specifically includes the following steps:
[0023] S1: Text similarity calculation: select text a and text b from the data to be clustered (news data, WeChat official account data, Weibo data, and post bar data), and calculate the Jaccard distance of text a and text b; Extract keywords Sa and Sb from text a and text b, the number of keywords is 35, and then calculate the intersection |A|=Sa∪Sb between the corresponding keywords of the two texts, and the union |B|=Sa∪Sb, where Jaccard distance (...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com

