Maximum frequent item set mining method based on adjacency list

A technology of maximum frequent itemsets and frequent itemsets, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as low space and time performance, algorithms cannot work effectively, and occupy large memory, etc. Achieve the effect of low time complexity, reduced I/O operations, and efficient performance

Inactive Publication Date: 2019-05-21
NORTHWESTERN POLYTECHNICAL UNIV
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But it needs to create an FP-Tree containing all dataset items, which takes a lot of memory, and the memory consumption is proportional to the FP-Tree width and depth
The depth is generally the maximum number of items in a single transaction. If the number of frequent 1-itemsets in the database is large, and the memory cannot load the mapping information of all items in the FP-Tree, the algorithm will not work effectively
And scanning the database twice, generating and recursive FP-Tree multiple times also makes the space and time performance of the algorithm not high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Maximum frequent item set mining method based on adjacency list
  • Maximum frequent item set mining method based on adjacency list
  • Maximum frequent item set mining method based on adjacency list

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0044]

[0045]

[0046]

[0047]

[0048]

[0049]

[0050]

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a maximum frequent item set mining method based on an adjacency list, and the method employs a storage mode combining the adjacency list and a Hash table to reduce the scanning frequency of a database and the spatial scale of traversal, can trim out an item set smaller than a support degree threshold as early as possible, and avoids the generation of all non-empty subsetsof a longer maximum frequent item set. According to the method, the established adjacency list is fully utilized, the original database only needs to be scanned once, and the method has the advantagesof low time complexity, memory consumption and the like.

Description

Technical field [0001] Maximum frequent itemsets mining is an important issue in many data mining applications. The present invention relates to a maximum frequent itemsets mining technology based on adjacency lists. In the process of studying the maximum frequent itemsets mining, considering the application of adjacency lists as data storage media The data is stored for later mining of the largest frequent itemsets. The technology can be applied in many fields, including health care, education, manufacturing, etc. It can also be used in daily customer relationship mining management, fraud intrusion analysis, online shopping analysis and other aspects, with a very large range of applications And market potential. Background technique [0002] Data mining is the process of extracting potential, unknown and not easy to be discovered intuitively from a large amount of data, and finally can be expressed as understandable knowledge. Association rule mining is one of the important co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/2458G06F16/2455G06F16/22
Inventor 殷茗王文杰蒋丹张煊宇曹宏业穆瑞杨益吴瑜
Owner NORTHWESTERN POLYTECHNICAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products