Association rule mining method based on mass data

A technology of data association and rules, applied in the fields of electrical digital data processing, special data processing applications, instruments, etc., can solve the problems of long processing time, many data records, low efficiency, etc., and achieve the effect of reducing the number of iterations and improving processing efficiency.

Inactive Publication Date: 2013-08-21
CHONGQING UNIV OF POSTS & TELECOMM
View PDF2 Cites 52 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, when mining data association rules for massive data records, on the one hand, due to the limited memory capacity of a s

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Association rule mining method based on mass data
  • Association rule mining method based on mass data
  • Association rule mining method based on mass data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The method for implementing data association rule mining provided by the embodiment of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0046] figure 1 A schematic structural diagram of frequent pattern tree pruning provided by an embodiment of the present invention, including:

[0047] Perform frequent 1-itemset statistics on the source data, obtain the conditional pattern base, construct the frequent pattern tree, and use an optimization method in the process of mining the frequent pattern tree. For specific steps, see figure 2 , including the following steps:

[0048] Divide the input data and distribute it to multiple computing nodes. Each computing node counts the frequency of 1-itemsets in parallel, sorts them according to the frequency of 1-itemsets, and deletes 1-itemsets smaller than a given threshold to construct frequent 1-itemsets. -itemset table; according to the frequent 1-itemset table to group fr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an association rule mining method based on mass data and relates to the technical field of information processing. The association rule mining method specifically comprises the steps that input data are divided according to records, a frequent 1-itemset table is set, data projection is carried out on the frequent 1-itemset table, the projected data are stored in a set grouped table, a local frequent pattern tree is set, then data association rule mining is carried out on the frequent pattern tree, in the process of mining on the frequent pattern tree, a pruning strategy is adopted, the number of iterations of the association rule mining is reduced, each computation node only processes one part of data records, and therefore the problem that the mass data can not be read into a memory by one computer to be processed is solved. In addition, various nodes participate processing in parallel, and processing efficiency is improved effectively.

Description

technical field [0001] The invention relates to the field of data mining, in particular to a method for realizing data association rule mining. Background technique [0002] With the rapid development of information technology, it has become a very difficult task to discover valuable information and knowledge from increasingly large and complex data to serve the purpose of decision-making. Data mining technology came into being in this context. Association rule mining is an important branch of data mining, and it is also the most widely used type of data mining. [0003] The purpose of mining data association rules is to discover the noteworthy associations or correlations among a large number of data items, and the typical application is shopping basket analysis in the retail industry. The so-called shopping basket analysis refers to the study of the association rules of the data, and discovers the connection between different commodities in the transaction database, whic...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 杨勇王伟
Owner CHONGQING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products