Association rule algorithm based on Apriori improved algorithm

A technology for improving algorithms and rules, applied in computing, instrumentation, electrical and digital data processing, etc., can solve problems such as large I/O loads, and achieve the effects of improving work efficiency, improving operating efficiency, and shortening time

Inactive Publication Date: 2016-06-29
NANJING UNIV OF SCI & TECH
View PDF2 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, with the increase of research data, the requirements for algorithm performance are getting higher and higher, and the defects of Apriori algorithm are gradually exposed.
The traditional Apriori algorithm has two bottlenecks. One is that multiple scans of the database generate a huge I / O load, and the other generates a huge candidate set, which is a great challenge to the time and space occupied by the algorithm.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Association rule algorithm based on Apriori improved algorithm
  • Association rule algorithm based on Apriori improved algorithm
  • Association rule algorithm based on Apriori improved algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0012] The present invention will be described in further detail below in conjunction with the accompanying drawings.

[0013] 1. Bottleneck analysis of Apriori algorithm

[0014] The classic algorithm of association rules Apriori generates frequent itemsets by iteratively searching the database, which plays a great role in exploring the inner relationship of transactions, and is an important branch of data mining research. However, with the increase of research data, the requirements for algorithm performance are getting higher and higher, and the defects of Apriori algorithm are gradually exposed.

[0015] The Apriori algorithm has the following two bottlenecks that affect performance:

[0016] (1) Scan the database multiple times, resulting in huge I / O load

[0017] First, in the k-th cycle, in the candidate k-itemset generated by the connection, the (k-1) item subset of each item needs to scan the previous frequent (k-1) item set to determine whether the item is is a fr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an association rule algorithm based on an Apriori improved algorithm. First of all, a transaction database D is preprocessed, after data records are simplified, the data records are all read into a memory, in the process when candidate sets are generated through connecting and cutting frequent item sets, the process when the candidate sets are generated is improved, a candidate item set is directly generated, the database is scanned for calculating support after the candidate sets are obtained, and since the candidate sets and the transaction database D are ordered, when the candidate sets are respectively searched for in each transaction T, i.e., each record, once values greater than a candidate item are sought, search of the transaction is stopped. According to the invention, the improved Apriori algorithm is applied to a pharmacy management system, results indicate that the performance of the improved algorithm is obviously better than a conventional algorithm, the operation is concise, and actual demands are better satisfied.

Description

technical field [0001] The present invention relates to the fields of data mining and algorithm analysis, in particular to data mining technology. Background technique [0002] In my country, with the development and popularization of information technology, data warehouse technology has also entered a period of rapid development, but there is still a big gap compared with developed countries. The competition in the domestic retail market is fierce, especially the competition in the commodity market is even more severe. Although some enterprises in our country have gradually entered into information warehouse management, at present, there is no mature data warehouse technology in China, resulting in the emergence of "massive data, In the case of "lack of information", it is impossible to use a large amount of information data reasonably and effectively, which is a great loss to the enterprise. More importantly, the domestic data warehouse application has not applied it with...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06Q30/02G06F17/30
Inventor 高曾荣何新应国力王建宇周宇浩袁侃辉
Owner NANJING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products