Big data association rule mining method based on Spark
A technology of big data and rules, applied in the fields of electrical digital data processing, special data processing applications, digital data information retrieval, etc., to achieve the effect of fast and efficient mining, less memory and IO usage
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0026] Below in conjunction with accompanying drawing, technical scheme of the present invention is described further:
[0027] figure 1 Shown is a flow chart of the method of the present invention, and each step is described in detail according to the content shown in the flow chart.
[0028] First read the converted vertical database file in the local or HDFS, get the RDD after reading the file, call the filter() in the conversion operator to filter the RDD, remove the data whose support degree is less than the minimum support degree, and then get frequent 1- itemsets. According to the obtained frequent 1-itemsets, the intersection sets are continuously obtained to obtain frequent 2-itemsets. For the acquisition of frequent K(K>2)-itemsets, the frequent 2-itemsets should be prefixed first, and then the frequent 2-itemsets and prefixes should be added to the divided data to obtain frequent 3-itemsets. By analogy, to find frequent K-itemsets, it is necessary to extra...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


