A spark-based apriori parallelization method, system and device
A frequent collection and transaction technology, applied in data mining, instrumentation, computing, etc., can solve the problems of slow frequent collection, inflexibility, and increased network overhead, so as to improve generation speed and efficiency, overcome large network overhead, and overcome Generating Slow Effects
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0096]The existing APRIORI is in parallel and operates locally. Since the stand-alone resources are limited, the speed of the candidate set is slow and the efficiency is low; in addition, the existing pruning operation only broadcasts the transaction database, leading to face When the big transaction database, the overhead of the network increased significantly, and the speed of the generation frequency also decreased significantly. The present invention proposes a SPARK-based APRIORI parallelization method, system and device, overcomes the above disadvantages of the prior art, improves operational speed and efficiency, and also reduces network overhead.
[0097]The following is a detailed description of the SPARK-based Apriori parallelization method of the present invention from the noun interpretation and specific implementation.
[0098](1) explanation of noun
[0099]The proprietary noun relating to the present invention is as follows:
[0100]Spark Calculation Frame: Spark is a framework f...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


