Hierarchical clustering method based on Hadoop and HBase
A hierarchical clustering and algorithm technology, applied in special data processing applications, instruments, electrical digital data processing, etc., to achieve the effect of improving scalability and big data processing capabilities, improving scalability and processing big data capabilities
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0020] In order to make the technical solutions and advantages of the present invention clearer, further detailed description will be given below in conjunction with the accompanying drawings, but the implementation and protection of the present invention are not limited thereto. It should be pointed out that, if there are symbols or processes in the following that are not specifically described in detail, those skilled in the art can understand or implement them with reference to the prior art.
[0021] 1. Parallel calculation algorithm of distance matrix
[0022] The parallel calculation algorithm of the distance matrix aims to improve the calculation speed of the distance matrix and quickly import it into HBase. In the process of clustering, the hierarchical clustering algorithm needs to rely on a space complexity of O(n 2 ) distance matrix, in this method, a Hadoop-based parallel computing algorithm is designed and implemented for the calculation of the distance matrix, s...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com