Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Frequent item set mining method

A technology of frequent itemsets mining and frequent itemsets, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of insufficient mining efficiency of mining results availability, hindering the application of differential privacy protection technology, etc.

Inactive Publication Date: 2016-07-06
BEIJING UNIV OF POSTS & TELECOMM
View PDF0 Cites 31 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the above algorithms have shortcomings in the availability of mining results and mining efficiency, which hinders the application of differential privacy protection technology in frequent pattern mining research.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Frequent item set mining method
  • Frequent item set mining method
  • Frequent item set mining method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0078] The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.

[0079] The flow of the frequent itemset mining method in the embodiment of the present invention is as follows: figure 1 shown, including the following steps:

[0080] Step S1: Preprocessing the original database. Using the intelligent segmentation method, the transaction in the original database whose length is greater than the specified limit length is divided into multiple sub-transactions, so that the length of each transaction in the transformed database is not greater than the specified limit length.

[0081] Step S2: Mining frequent itemsets under the premise of satisfying differential privacy protection. Mining frequent itemsets in the transformed database accor...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of data mining and data privacy, and discloses a frequent item set mining method. The frequent item set mining method comprises the following steps: S1: segmenting a transaction of which the transaction length is greater than a restriction length in an original database into a plurality of sub-transactions, and causing the length of each transaction in the segmented database to be smaller than or equal to the restriction length; and S2: according to a support degree threshold value which is appointed in advance, utilizing a support degree estimation method and a dynamic descent method to mine the frequent item set in the segmented database. The frequent item set mining method can provide higher mining efficiency and mining result availability while differential privacy protection is met.

Description

technical field [0001] The invention relates to the technical fields of data mining and data privacy, in particular to a frequent item set mining method. Background technique [0002] Frequent itemset mining is a basic problem in the field of data mining, and it has a wide range of applications in many fields. Frequent itemset mining can be described as follows: Given a transaction database, each transaction corresponds to a user's personal record. where a transaction is a collection of items. Given an itemset (a collection of items), its support refers to the number of transactions that contain this item set. When the support of an itemset is not less than a given threshold, the itemset is called a frequent itemset. When a transactional database and a threshold are given, frequent itemset mining is to mine all frequent itemsets that appear in the database. [0003] In frequent itemset mining, the FP-growth algorithm [2] is a widely used mining algorithm. FP-growth alg...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 程祥苏森许胜之徐鹏双锴王玉龙张忠宝
Owner BEIJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products