Frequent item set mining method and system supporting differential privacy

A technology of frequent itemsets mining and frequent itemsets, which is applied in the fields of instruments, calculations, electrical digital data processing, etc., can solve the problems of waste of resources and low efficiency of mining methods, etc.

Inactive Publication Date: 2017-08-25
HUAZHONG UNIV OF SCI & TECH
View PDF2 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at the above defects or improvement needs of the prior art, the present invention provides a frequent itemset mining method and system supporting differential privacy. The inefficiency of the method and the technical problem of wasting resources due to the generation of a large number of useless θ-basis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Frequent item set mining method and system supporting differential privacy
  • Frequent item set mining method and system supporting differential privacy
  • Frequent item set mining method and system supporting differential privacy

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.

[0060] Such as figure 1 As shown, the frequent itemset mining method of the present invention supporting differential privacy includes the following steps:

[0061] (1) Receive input data set D={D 1 ,D 2 ,...,D n}, privacy budget ε, and the number of returned frequent itemsets k, according to the input data set D to determine the constant set C = {C 1 , C 2 ,...,C m}, and determine the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a frequent item set mining method and system supporting differential privacy, and belongs to the field of computer data privacy protection and data mining. The method comprises the steps of processing an original data set by adopting a transaction truncation method, and avoiding privacy leakage in the truncation process by using an exponential difference mechanism-based method; then constructing a tree structure, wherein each node corresponds to an item of a transaction, frequent information on the transaction is stored in a node corresponding to the last item of the transaction, and the support degree of each node is initialized by using Laplace noise to avoid the privacy leakage in the tree construction process; and converting the constructed tree into an FP tree, so that a frequent item set can be mined by using an FP-Growth method. According to the method and the system, the released frequent item set can meet security demands of the differential privacy; compared with an original frequent item set, the released frequent item set has relatively high similarity with the original frequent item set and relatively high data availability; and compared with a conventional algorithm, higher efficiency is achieved.

Description

technical field [0001] The invention belongs to the technical field of computer data privacy protection and data mining, and more specifically relates to a frequent item set mining method and system supporting differential privacy. Background technique [0002] With the advent of the era of big data, more and more data are being collected and analyzed by some statistical agencies. Sometimes, these agencies will release some data for use by third parties, and these data may contain some sensitive information of the data collected, such as a patient suffering from a certain disease. Therefore, publishing these data directly will cause the privacy of the data collectors to leak. In order to protect the privacy of the data collectors, the data issuer must take privacy protection measures. [0003] At present, privacy protection is widely used in frequent itemsets mining. The existing frequent itemsets mining method that supports privacy protection is mainly the PrivBasis metho...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F21/62
CPCG06F21/6245G06F21/6227
Inventor 丁晓锋金海陈龙
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products