Mass data frequent item set mining method
A technology of frequent itemset mining and frequent itemsets, applied in data mining, digital data processing, special data processing applications, etc., can solve problems such as incorrect data structure construction and complex data structure
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0083] figure 1 is a schematic flowchart of a method in one embodiment of the present invention. in, figure 1 The execution subject can be a computing node, a server, or an ordinary PC. The massive data frequent itemset mining method is used to mine the frequent itemsets satisfying the global minimum support degree minsup in the total transaction data set T, and the global minimum support degree minsup is the minimum support on the preset total transaction data set T degree, the total transaction data set T includes the original transaction data set T O and the new transaction data set T Δ .
[0084] see figure 1 , the massive data frequent itemset mining method includes:
[0085] Step 110, use the frequent itemset mining algorithm to analyze the original transaction data set T O Mining to obtain the original transaction data set T O corresponding to all local frequent itemsets.
[0086] Specifically, the original transaction data set T can be read sequentially O and...
Embodiment 2
[0141] Compared with Embodiment 1, the difference is that the massive data frequent itemset mining method described in Embodiment 2, in order to further improve the mining rate of the present invention, between the above-mentioned step P11 and step P12, also includes the step S: for the set LF k,Δ The fine cutting step;
[0142] where, for the set LF k,Δ The refinement steps include:
[0143] According to whether the first (k-1) items of the item set are the same, obtain and divide the set LF k,Δ The grouping of frequent k-itemsets in , obtains a corresponding number of itemset groups, wherein, the first (k-1) items of the frequent k-itemsets in the same itemset group are the same;
[0144] Count the number of frequent k-itemsets in each itemset group separately, and judge whether the counted numbers are equal to 1: if yes, delete the corresponding itemset group, and delete the set LF k,Δ Among the itemsets identical to the frequent k-itemsets in the corresponding itemset ...
Embodiment 3
[0158] Compared with Embodiment 2, the difference is that, in the method for mining frequent itemsets of massive data described in Embodiment 3, when updating the newly added transaction data set T Δ , and upon updating the original transaction dataset T O is the original original transaction data set T O and the original new transaction data set T Δ and the updated new transaction data set T Δ When non-empty, also includes the step of updating mining.
[0159] Specifically, the step of update mining described in this embodiment includes:
[0160] Update the number n of transactions in the total transaction data set T to the original original transaction data set T O and the original new transaction data set T Δ The sum of the number of transactions;
[0161] Obtain the original original transaction data set T obtained above O For all the corresponding local frequent itemsets, calculate the obtained original original transaction data set T O The corresponding local fre...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com

