Distributed clustering method facing to internet micro-content
A distributed clustering and Internet technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of high maintenance cost, not, not ideal, etc., to achieve wide application range, simple operation and low maintenance cost small effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0033] In the clustering application system oriented to Internet micro-content, the distributed clustering method provided by the present invention can be used to quickly and accurately cluster massive micro-content. Taking the blog comment spam clustering system as an example, the specific The implementation steps are as follows:
[0034] 1) The main control machine first performs segmentation operation on the blog comment source file to obtain multiple small source data files. The specific process is as follows:
[0035] For the input large blog comment source file, write it into multiple small files according to the fixed number of records in each file, and write one blog comment in each small file. The fixed number of comments is determined by the specific implementation of meta-clustering The configuration of the clustering machine is determined by the operation. Figure 2 shows the structural diagram of the segmentation module, where Split_1, Split_k, and Split_n in Figur...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com