Unlock instant, AI-driven research and patent intelligence for your innovation.
Local scan association rule computer data analysis method based on pre-judging screening
What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A data analysis, computer technology, applied in computing, data mining, electrical digital data processing, etc.
Active Publication Date: 2017-02-15
CHINA INFOMRAITON CONSULTING & DESIGNING INST CO LTD
View PDF2 Cites 1 Cited by
Summary
Abstract
Description
Claims
Application Information
AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology
Problems solved by technology
This algorithm has good scalability and efficiency when processing massive data sets, but the calculation requires strong computing and storage capacity support, usually running in a cluster environment
Method used
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more
Image
Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
Click on the blue label to locate the original text in one second.
Reading with bidirectional positioning of images and text.
Smart Image
Examples
Experimental program
Comparison scheme
Effect test
Embodiment 1
[0062] Such as Figure 10 Shown, the present invention comprises the steps:
[0063] Step 1, scan the transaction database D to get the set L of frequent k-1 itemsets k-1 ;
[0064] Step 2, the set L of frequent k-1 itemsets k-1 Connect with itself to generate a set of candidate k-itemsets, and the set of candidate frequent k-itemsets is denoted as C K ;
[0065] Step 3, using the Apriori property (all non-empty subsets of any frequent itemset must also be frequent, if a candidate non-empty subset is not frequent, then the candidate must not be frequent) to set C K pruning;
[0066] Step 4, calculate the set C K Pre-judgment support of members in the group, and pre-judgment screening;
[0067] Step 5, perform partial scan judgment on the transactional database D;
[0068] Step 6: Repeat steps 2 to 5 above until no larger frequent itemsets can be found;
[0069] Step 7, the final frequent item set set is recorded as F, then the association rule R={X->Y, X, Y is the fre...
Embodiment 2
[0088] through the pair such as Figure 9 The supermarket sales data set shown (the data set contains 10,000 sales records, that is, 10,000 things, 112 kinds of commodities, that is, 112 items) uses the MAWP algorithm to analyze the association rules, and the performance of the MAWP algorithm is verified. Minimum support min_support=0.05.
[0089] In this example, the frequent itemsets obtained by running the MAWP algorithm and the AWP algorithm and the Apriori algorithm are exactly the same, but the Apriori algorithm needs to scan the data set 967 times, while the MAWP algorithm and the AWP algorithm only need to scan the data set 682 times, compared with the Apriori algorithm. Reduced by 29.47%; the number of transactions scanned by the Apriori algorithm is 9.67×10 6 , the number of transactions scanned by the AWP algorithm is 6.82×10 6 , the number of transactions scanned by the MAWP algorithm is 4.6992×10 5 Compared with Apriori algorithm and AWP algorithm, MAWP reduces...
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More
PUM
Login to View More
Abstract
The invention discloses a local scan association rule computer data analysis method based on pre-judging screening. Aiming at inherent defects of a classic Apriori algorithm, a local scan association rule analysis algorithm-MAWP algorithm based on transaction number query is provided based on an association rule analysis algorithm based on pre-judging screening. The algorithm records transaction numbers including a frequency k item set, and partially scans the transactions in the database but not completely scans the same during a process of verifying the screened candidate k item set, namely only scans a transaction set including the k-1 item set and with the minimal amount of the transactions, so that the total amount of the transactions scanned for determining the frequency item set is decreased, the operation time of the algorithm is reduced, and the operation efficiency of the algorithm is improved.
Description
technical field [0001] The invention belongs to the technical field of computer data mining and information processing, and in particular relates to a computer data analysis method for local scan association rules based on pre-judgment screening. Background technique [0002] Today, with the rapid development of big data technology, people gradually realize that data is wealth, especially the analysis of business data has great practical value. As one of the main methods of data mining, association rule analysis is an indispensable and important part of data mining technology. It is mainly used to discover valuable and interesting connections and rules hidden in large transaction databases. Therefore, the research on association rule algorithms is of great significance. [0003] As early as 1993, IBM's computer scientist R. Agrawal and others discovered the purchase rules of customers when purchasing goods in the customer transaction database, and proposed the correlation m...
Claims
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More
Application Information
Patent Timeline
Application Date:The date an application was filed.
Publication Date:The date a patent or application was officially published.
First Publication Date:The earliest publication date of a patent with the same application number.
Issue Date:Publication date of the patent grant document.
PCT Entry Date:The Entry date of PCT National Phase.
Estimated Expiry Date:The statutory expiry date of a patent right according to the Patent Law, and it is the longest term of protection that the patent right can achieve without the termination of the patent right due to other reasons(Term extension factor has been taken into account ).
Invalid Date:Actual expiry date is based on effective date or publication date of legal transaction data of invalid patent.
Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/2465G06F2216/03
Inventor 赵学健袁源孙知信乔爱锋陈思光王鹏
Owner CHINA INFOMRAITON CONSULTING & DESIGNING INST CO LTD